A Reproduced Copy 



(NASA-CR-1 846 87) PROCEEDINGS OF THE 

WORKSHOP ON SPACE TEL E ROBOT ICS, VOLUME 3 
(Jet Propulsion Lab.) 412 p CSCL 09B 


G3/63 


N 8 9- 2 65 41 
— THRU — 
N89-26577 
Unclas 
0188169 


Reproduced for NASA 
by the 

NASA Scientific and Technical Information Facility 


FFNo 672 Aug 65 


This publication was prepared by the Jet Propulsion Laboratory. California Institute 
of Technology, under a contract with the National Aeronautics and Space 
Administration. 


ABSTRACT 


These proceedings report the results of a workshop on space telerobotics 9 
which was held at the Jet Propulsion Laboratory, January 20-22, 1987. 

Sponsored by the NASA Office of Aeronautics and Space Technology (OAST), the 
workshop reflected NASA’s interest in developing new telerobotics technology 
for automating the space systems planned for the 1990s and beyond. The 
workshop provided a window into NASA telerobotics research, allowing leading 
researchers in telerobotics to exchange ideas on manipulation, control, system 
architectures, artificial intelligence, and machine sensing. One of the 
objectives was to identify important unsolved problems of current interest. 

The workshop consisted of surveys, tutorials, and contributed papers of both 
theoretical and pratical interest. Several sessions were held with the themes 
of sensing and perception, control execution, operator interface, planning and 
reasoning, and system architecture. Discussion periods were also held in each 
of these major topics. 
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t. Abstract 

It is prepowd that tbs afaOxty to damify pwhkai by type 
sad solution techniques by affUab&y is shinti f w far 
any problem ahcr. Yet no dnsfiadas schemes an nsl- 
abte in the important anas of robotic p fe nning and acbedul- 
ing. A sranrtaid ap p ro a ch aa d e v ri op m g schema broad an 
work rep orte d in the l i ter atur e is outlined. bat is fosnd to 
be rather onsstaf ymg. An alternative approach is sketched 

based upon the ancertasnxy inherent in d ealing with the teal 
world, and is frond to espeaxe man of the in t niti ve dif- 
ferences ben re es planning and srha d aling. A trial iiplaiee 

then offered, the troth of three teroerch groups on ro boti c 
aroembty is given as ropparc for the di s tin ctions drawn. 


1 Introduction 

Arguments axe iatxadneed in this action in the onntrit of 
providing penpectiv e and m o ti v ati on for the rtienisnfsi which 
follows. The topes include oar ap proa ch e s to the shady of 
artificial intell ig e n c e and caeepnfier science as wai ro ro bo tics . 

In some fields of ready, it is ptroihlr. anally after the eft- 
penditnre of great effort, to synthrooe Taws* which describe 
various phenonens of intense. Throe laws can be road to 
explain the remits of past riprrimr s t s or to predict the ant- 
come of future trials. Use to explain and predict can farther 
confirm the a cc ur acy and generality of the laws, or highlight 
Decenary r e vision and refinement. E ve ntua lly the laws can 
be utilized in the cosscznction of entities to display apsofk 
p rope rti es in the domain of interrot. Examples of this 

progTeation can be drawn from the natural narncro (snch as 
physics) and their a p pli ca t i o n disnpHnro (such as mechanical 
engineering! 



, if e 

to be 




(concepesaUy) of two 
The first data s trat a 



cion of a 



The fixst 


proems asps any fiiliis is w hich ths i s uffi g rui entity cn- 
conatea ism the p w b i un apace for pnrpo ro s of cla ss ification 


The 


solution mrfmiqm qpeos far the purpose of wlecting the 


the peubtem. Ths third procero m e negro ths i ns rantistion and 
execution of the indka ne d met h ods to p rovide a station. 


At the ttirfr burl qtxeme. the problem type and solutio n 
technique d aronption s woold be very specific. while at the 
law-feaed iiTnroa they would be very grroral However; 
the |i i oc «— at cither ex t reme perform the same tasks, even 
though they ought be quite different in o p erational detail. 
In any cm* this model that the intelligence of an 

entity retidro in being able to categoeia prob l ems (the fiat 
pro c ero and the fiat data structure! select the ap prop ria te 

structure! and produce a rotation (the t^nd proms) Fur- 
thermore. the m od el p rovide s far various foam of kerning 
by ixptnri aa. refinement. and a b s tr ac ti o n of ths dats struc- 
tures as well m improvement of the m a p p ing pro c eroen . 

The object of raising this argument is S point oat that 
there seems s be an asymmeoic di s tributio n of work 


Some researchers subscribe to the belief that inquiries into 
the nature of intelligence still lead to the d i scovery of a 
(small) set of accurate, general lavs of intelligence. They 
believe that such la w s, sufficiently refined and understood, 
might enable the co ns tru ction of entities which display inael 
ligence and urn h umans as an exa sa ea c e proof that such en- 
tities axe pn roi b l e . Other romarchces axe not comfortable with 
the sumption that a sec of laws underlies intelligent be- 
havior in a directly analogous fashion to the physical laws 
which form the foundation of physka. They argue ramrod 
that intelligence is inherently heard an Warning and wing a 
Urge and drvear bag of highly specialized tracks. Construct- 
ing entities which dnplay xuaeUigenro can be acaoroplnhed by 
d isc ove r ing a few of these trick* recoding them in a com- 
puter program, and using them to solve problem* Intel- 
ligence can be incrementally im p rove d by adding mure trick* 


r e ported in the li treater s an planning and scheduling in ths 
enwtger of the suggested of intelligent run tie* These 

has been much work armed at synthroensg powerful general 
purpose planning mocha mama. and some work do n e to bedd 
special purpose pfestmm and srhndulrrs which solve specific 
problem* Thro work c orresponds to the aaooad data str u ctu re 
and the third p roce ro in the model, nepectmly. As ought 
be expected f nro the model, the specific pfemurrs are very 
often speci a l ratio n s of the general plannrn. Unfortunately, 
it is srldnro (if ever) the case that a clear statement of the 
characrmstica of the pro btems far which any of the planners 
or schedulers hav« baen designed is explicitly offered. This is 
a reflection of the fact that little if any work has been 
carried out con ce r ni ng the vocabulary of p robl em type* Ob- 
viously. without the vocabulary, rt is also very difficult to 
sake p ro gr am an the procrosro of rl— fymg p robl em types 
hoed un problem feetutro or srlecting soluoau methods based 






vod reantias on tte dmkfBKnt of a pro ble m type 
vocabulary a protect aa teQj|nC entity. 

tfffiitli of »f— * M on a a pa rulativ* n ote of in- 
nOfat enrirto* tkn an cater Jaarifrationa far the e x pl ore- 
tion of fates type aad alette w riiaipt d escripti on 
axabeluis There ia taaaos a bdkvt that any pw tea 
atver* natural or artificial. hsaaaa or rob otic , benefits from 
bring able ao cteify frattea which it fscoantets aad to 
cnlna ahtte appr oa cte a to thoae problems Aa outsoad- 
tag mifh of the power which cam be (noted by ex- 
plicitly snalyiiiig poten aad rotation spaces can be foaad 
ia two of the aoo important cooc e pe e of co mpu ter s rirune - 
e nraput e brtity aad cnmplriity . The fixst aada oa ia the 
categor izati on of p e o h kma sam thorn which axe ia principle 
nampntihlt aad thorn which ait ace compuabt* The a e c on d 
supplies the metric far d e ri d fog whether a rotation approach 
to a c o mpa tible pro ble m will be relatively efficient 
(polynomial ran rime} or relatively inefficient (ex p on enti a l 
run timei 

It is s ug ges ted that this type of analysis, and more, is in- 
dicated far the problem type and the rotation technique 
spacee ia aad tthartulmf. It would be useful for 

Teamrhen to have a problem description vocabulary though 
which they coaid dtonrea the aunilaritka aad differences 
among p robl ems . Furthermore, to have a map of the known 
po rti on s of the rotation approach space would help developers 
assess the results of gristing mroarch and would help focus 
new rese arch efforts aa the unknown regi ons. In addition, 
these two sp ace s axe neceamry for the conetxtacrioa of intel- 
ligent entities under the model d es crib e d earlier. 


X A Standard Approach 

Ia an effort to constru ct aa initial des crip tion of these 
spaces far planning ty p e s of problems, a partial literature 
review baa been attempted. Planning p r o bl ems are usually 
considered to take aa input scale des c r i ptions, action descrip- 
tions. and path constraino* and to produce as output some or- 
dered mt of states and actions which obey the path con- 
straints. One form of p robl em and solution categorization 
scheme eta be baaed aa the crating literature in which this 
input/output view of p**«mt>g a afcm , 

The state descriptions taken as input c o n si st of an initial 
state and a roal state. A problem categor iza tion rohe me an 
use as one measure the size aad complexity of the initial tad 
goal states given. The methodology used to encode the distin- 
guished initial and goal states ia also used by the pi*«n*r to 
construct tnal intermediate states during synthesis aad to 
partially describe the plan whkh the syst em outputs. It is 
often the case in robotic proh l rms that sate descrip ti ons con- 
cern. at same level of abstraction. objects, their p ropertie s, and 
their r el a tion s hip s to other objects A rotation categorization 
scheme could use as one measure the technique used by the 
planner to r epr es en t sates. 

The a cti o n descriptions taken as input for robotic problems 
i n cl ud e a catalog of agents aad facilities involved along with 
their cupehilitica in terms of «*• ind perception. 

One useful measure for probl em categorization is the number 
aad nature of the agents d e scri bed. In problems involving 
robots, the laws of physics must always be considered as sup- 
plying actions nach aa gravity which all agents must recog- 
nise tax which ao agent can control. Beyond "mother 
nature* there can be one agent or many agents. If there axe 
many agents, they must be defined ia terms of their reUtion- 
shipe (c o operat i ve, neutral, competitive), communications fnone. 
otrarirml. contzsuouaX and ahaxed information (none, partial, 
complete), id name tat a few of the relevant characteristics. 


Early phumera usually ronridered tingle agent p la nning timn- 
tions lu X 31 Mom recently, many itenfriirre have footed 
on mnltiagent planning problems aad the closely related am 
of dntiibu ta d planning k 5, ^ 7, k 9, IX 11* IX IX K # 

is. ix in 

The action descr ip tion metho d olo gy mad by the p l ann e r can 
alro be used in a ca te go rizati on retain* In lefete 

pro bl em* actions often change the p roperties of objects or the 
relations ™"**»g objects One useful manure for statin s 
« whether or not action d uration m explicitl y 
represented. The eaxty planners did not explicitly mender 
duration [l. X 3l but mote recently the representation and 
am of — «*"n dura tfcw te been a fertile ana of research [11 
19. 2Q, 21 221 btemting qu csti o ni involve the urn of time 
in an absolute or relative way and the representation of time 
as points or intervale. 

The path constraints taken as input are used to define the 
general type of rotation path desired aad often have the 
fan of remri ctio te on properties of the initial and goal 
sate* Boar categories are easily di s tinguish ed as being awful 
in problem claasificatio* Problems with p rop erti es whkh axe 
pream t in the initial earn and whkh must also be pream t in 
the goal sate are preservation problems. Complementary 

prevention problems have properties which are absent in the 
initia l sate and which must also be absent in the goal stale. 
Problems with p ropertie s which are present in the initial 
state but must be afaaent in the goal mats are demractioa 
problem* The oampkmcntary construction problem* whkh 
have been moat studied in the literature, have pro p errirs 
whkh are afaaent in the initial sate tat must be present in 
the goal sate. 

Further measurements for the construction of the solution 
technique space out be found by cocted ering the way in 
which the planner goes about synthesizing ia output [23J. 
Early planner* used weak methods such aa meem/cads 
analysis and simple back-chaining to rolve rather ample 
problem* As researchers began to attempt more aad some 
complex problem* more powerful techniques were used. The 
idea of hierarchical atatraction was used to focus the plaonrr 
an the hardest puts of the problem first [24. 251. Next to 
be considered were conjunct ordering, subgoal interaction, sad 
the nonlinear nature of plana [26. 27. 28. 29, 30, 31, 32. 33, 
34 J. Later, pre-f armed plan segments were built ism plan- 

ners to provide item with a mechanism of evokfiag 
repeatedly planning typical tasks from scratch [35. 36l 
Finally, planner* have been given mea-kvel control ro that 
they can plan about planning aa well as planning aboat mir- 
ing the problem at hand [37, 38l Related, but alternative 
view* of planning also have been taken. Opposed to the 
view that the planner should strive to make perfect plans 
exclusively, one planner p ro p osed making an imperfect plan 
tad debugging it [39l Another planner took the view that 
an attempt should first be made to d isprov e that goals can be 
reached before attempting to reach thoae goals [401. 

Pursuit of this type of "standard" approach will probably 
produce a useful rlteifiritino scheme far planning proble m 
types and solution technique* Much more work is required, 
and even then it remains to be seen whether practically aw- 
ful r cpr c atn tttiopa or data structures can be constructed in 
the context of the intelligent entity model whkh reimntard 
this investigation. It is suggested that a similar effort on 
scheduling problem* that to one based on input/ousput 
properties and on rep reaen ation/ reasoning characteristic* would 
yield nmilar result* Unfortunately, this given a set cf 
structures for planning and another set of structures far 
scheduling, either avoiding the question of the difference be- 
tween planning and scheduling altogether, or answering that 
there to a difference based on the narrow input/ output defini- 
tions used to build the structure* Given that the terms 
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"punning* and "scheduling” cover a very broad set of 
problems, such a narrow answer U not very satisfying. It 
seems intuitively obvious that a workable classification 
scheme should differentiate a set of problems which are 
purely panning problems, a set which are dearly s chedulin g 
problems, and a set which have characteristics of both, or it 
should show that there is a spectrum of problems nans of 
which are purely planning problems oar purely s chedulin g 
problems. 

It a suggested that two other observations about planning 
and scheduling problems can help to rectify this sense of dis- 
satisfaction. One c onc erns the nature of the uncertainties 
which are encountered in robotic tasks. The other c on cer n s 
ways in which to manage the uncertainty. Taken together, 
these two ideas provide a new perspective on the difference 
between pUnning and scheduling. 

4. A New Perspective - Observation 1 

An interesting exercise is to consider what ia known in the 
earliest stages of a pUn/achedule/execute cycle and to contrast 
this with what is known at the completion of such a cycle. 
In the early stages, not much ia known. In robotic tasks, it 
may be as little as an abstract goal (the need to repair a 
broken subsystem on a space station), some idea of the 
resources which might be called into pUy (any tools and 
spares on or available to the station), the objects which may 
be involved (the tools and all parts in and around the sub* 
system with the unknown faultX and the manipulatory and 
perceptual capabilities of the robotCs) in the area as well as 
its currently assigned tasks. In the final stages, there is a 
plethora of information. Every important intermediate state 
can be known to the resolution of the system sensors includ- 
ing the specific objects involved, the values of all of their 
relevant properties, and the details of all relationships be- 
tween objects. Every important action can be known within 
sensor resolution including start times, durations, end times, 
facilities utilized, even robot spatial and force trajectories. 
Uncertainty remains, to be sure, but hindsight is greatly supe- 
rior to foresight in robotic planning, scheduling, and execution. 

From this perspective, the important question ia what infor- 
mation ia required to resolve various uncertainties and when 
can this information be known with sufficient accuracy that 
commitments can be made. Planning, scheduling, and execu- 
tion can be viewed aa a con t in u ou s aeries of refinements in 
which the initial abstract concept is converted into the final 
concrete realization. An interesting characteristic of robotic 
tasks is that they are executed in the real world which ia 
the domain of Murphy's laws as well aa Newton's laws. 
Something will probably go wrong, but it cannot be deter- 
mined in advance what it will be or when it will happen. 
Furthermore, actions executed in the real world are not easily 
retracted. In other words, the penalty for commitment made 
too early will be extra expense, because, once executed, the 
wrong action will require effort to reverse, if it ia reversible 
at alL 

The uncertainty which is encountered during planning, 
scheduling, and execution has many sources. A major source 
stems from the fact that the robot (and any supporting com- 
putational device) holds an incomplete, inexact digital model 
of an analog world. The incompleteness is due to the in- 
herent fimteneas of the computer relatively to the practically 
infinite richness of detail in the world. The inexactness is 

due to the imperfect nature of the input and output channels 
which the robot (sad any supporting computational device) 
has to its worldly environment. Sensors aa input channels 
and mechanisms as output channels are only accurate within 
bounds. Any form of intelligent entity dealing with robot 
tasks must recognize and deal with them facta even though it 


Is helpless to change them. This uncertainty b reflected ia 
the structure of representations of state* and actions in sys- 
tems which must deal with the real world. State descrip- 
tions include on the values asaxaated with obja* 

properties. Action descriptions i nclude accuracies on the 
values achievable by their application. Recently, me ch a ni s m s 
have been reported which can be used to plan about nach 
uncertainty [4ll 

Since all robotic systems must handle uncertainty at the 
modeling, sensor, and m ec h a n ism level, this type of uncer- 
tainty is obviously not of much use for clsaeificetioo (other 
than realizing that the level of uncertainty in the system 
must be to the level of uncertainty in the task). 

Another source of uncertainty which occurs at a higher level 
ia useful, however. States and sate des c riptions involve a 
degree of uncertainty. 1ft micro scale, it is not always clear 
when a particular object which ia required will be available, 
or. if it is available at some point, whither it will Mill be 
suitable when needed. In macro-male, it is certainly possible 
to imagine that the specifications of initial and goal states 
can change with time, similar comments can be mads about 
actions. At the micro-scale, the availability of facilities 
changes with time, sometime unexpectedly aa a higher priority 
task encounters difficulties or the facility itself requires ad- 
justment or repair. At the macro-scale, it ia even pom h ie to 
imagine that the capability of a facility changes or that 
facilities are added to or deleted from the pool of reeouxms. 
These examples each point out that everything in the en- 
vironment (except the law* of phymea) is anceptihle to 
change over time, aometimea gradually, snmstim ss dramati- 
cally. Of fundamental concern is the uncertainty about the 
availability and suitability of agents, objects, and facilities, 
and when and to what degree this uncertainty can be 
resolved. 

It is certainly well understood in the robotics community 
that plans and schedules for robots are fragile in terms of 
executional integrity [42, 43]. The earliest pl a n n ing symems 
included execution monitors [4A\ and more recent work has 
been aimed at knowledge-based systems for error recovery [45, 
46]. Furthermore, methods have been p rop osed for incom- 
pletely specified or dynamic environment* [47, 48, 49l 

However, it is not clear that an uncertainty model is a 
popular one for the classification of pl ann i n g and s ched ulin g 
problems. None the less, it is proposed that it is precisely 
the view of planning, scheduling, and execution as a step by 
step resolution of uncertainty which begins to formaline the 
way in which we intuitively view the difference between 
planning and scheduling problems. 

5. A New Perspective - Observation 2 

The second observation complements this way of looking at 
problems by providing a perspective for looking at solution 
techniq ues to manage the uncertainties involved in robotic 
tMkt The key here is the concept of least commitment. In 
the face of uncertainty, various options can be t ake n . At 
one extreme, s guess can be made. If the guest can be m ade 
in such a way that it can be easily retracted if it is wrong, 
then this is a useful strategy [37]. At the other extreme, the 
decision can be deferred until enough information can be 
gathered so that the uncertainty can be absolutely resolved. 
Of course, it is seldom the case in robotic tasks that either of 
these extremes is obtained. It is not often that so little in- 
formation ia available that a blind guess most be m a rie , or 
that so much information is available that no doubt remains. 
It is left to the implementors of planners and schedulers to 
provide sufficient knowledge to their systems to be able to 
decide what degree of commitment is warranted by a given 
level of information. The point is that it is a difficult deci- 
sion. 
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Another point to that it to the ordering of actions, the as- 
signment of faculties to carry out actions, and the aoocUtion 
of starting and stopping times with actions which are being 
deferred. Them ideas are also not new to the AI or robotics 
communities, but have been used in different ways than to 
classify planning end scheduling solution technique s. As has 
been pointed out, one of the main jobs of a planning system 
to to order states and actions, so all systems have a basic or- 
dering capability [23). In addition, the concept of abstraction 
contains the ides of ordering tasks by importance to provide 
a focus for the reasoning system [23). Furthermore, advanced 
planners are baaed on the principle of least commitment in 
the ordering of plan fragments to avoid adverse mbgoal inter- 
actions [23. 30, 311 But in keeping with the spirit of this 
document, it has been suggested only a few times that the 
output of a planner should at times be limite d to a partially 
ordered set of actions, the final order to be resolved by the 
execution environment [42, 50). In the extreme, system* have 
been implemented that plan only incrementally, either baaed 
on the immediate need to do something other than an obvious 
atomic action [5ll or on the opportunity to easily achieve one 
of a list of goals [52l both driven by the environment 

6. la There a Difference 7 

On one hand, planning and scheduling problem* need to be 
classified by the form and content of the input accepted and 
the output delivered. Planning and schedulin g tec hn i q ues 
need to be classified by the representation and reasoning 
mechaniama which transform the inputs into the outputs. On 
the other hand, planning and scheduling types of problems 
can be clarified by the time at which sufficient information 
is av ailab le to define the *««**•! and goal states, the facilities 
catalog, the identity and relationship among agents, the desired 
actions, the ordering of the actions, the assignment of agents 
sad facilities to the actions, and the association of start and 
stop times to the actions. Planning and scheduling solution 
techniques can be classified by the manner in which each of 
these decisions is delayed until sufficient informati o n is avail- 
able to justify commitment. 

The planner is mainly concerned with what should be done 
and is invo. ed with using the initial and goal states along 
with the capabilities of the facilities in the catalog to select 
the desired actions. The scheduler is mainly interested in 
when the desired actions should be accomplished and uses the 
availabilities of the facilities in the catalog to aaaign facilities 
and times to actions. Sometimes the planner must order the 
actions based on precondition and postcondition arguments, and 
sometimes the scheduler must order actions based on environ- 
mental conditions. Sometimes the environment is very stable 
and enough information is available that the planner and 
scheduler can work together to make moat decisions prior to 
the start of execution. Other times the environment is very 
volatile and so little information is available that the planner 
and the scheduler must delay almost every decision until an 
instant before execution. 

Confusion between planning and scheduling often occurs be- 
cause the planner over-comxmta, taking on the task of order- 
ing actions when there is no justification to prefer one order 
over another, assigning facilities before the details of 
availability can be known, and even aanciating start and stop 
times before environmental conditions warrant such decisions. 
Such overcommitment by the planner prematurely removes all 
decision pro ces s es from the scheduler and makes the execution 
environment unnecessarily susceptible to failures. It to the 
responsibility of the planner to know when it is absolutely 
necessary to make these decisions because of unavoidable re- 
quirements based on physical laws or facility capabilities, and 
when it is possible, or even desirable, to leave these de ci s ion s 
for the scheduler. 


To underscore these concepts, it should be noted that three 
different research groups have beoome interested (apparently 
independently and simultaneously) in what can be considered 
t paradigmatic problem in the division of labor between n 
"planner* and a "scheduler". The do m ain to robotic amembty. 
especially in the cases where the aaemhly can he ac- 
complished by a number of different pathways. A p a rtic u lar 
problem might occur when a robot to amembUng from a com- 
plete but random stack of parts in an environment in which 
all the required facilities are always available. The pl a nn e r 
can commit to one particular assembly sequ enc e, uauping the 
job of the scheduler, but only at the cost of requiring the 
execution rystem to always reorder the parts so that they m 
available in a prescribed sequence. If instead the planner to 
willing to supply the required set of actions to the scheduler 
with only the minimum set of ordering constraints, the 
scheduler can efficiently reset to ths ordering uncertainty in 
the availability of parts to guarantee efficient exe cu tion . 

Notice that the same tomes arise if all the parts are avail- 
able all of the time (for example, bins of each individual 
part are available in the work ceil), but the specialized tools 
to perform the aaemhly am only available at random Tina s 
(for example, because they am used in a number of cells). 
An even more complex case to encountered if availability of 
both the parts and the tools to randomized. In all of these 
circumstance*, it to clear that the planner should produce a 
minimally ordered plan so that the scheduler can capitalize 
on the uncertainty in the environment rather than being 
hampered by it. 

The groups involved have been interested in the uembiy 
problem based on its own merit and have not been overly 
concerned with the impact of their work on the categoriza- 
tion of planning and scheduling problem types or solution 
techniq ues. They have been more interested in ths qu e s ti o n 
of devising the best way of representing sc h edu l i n g option* 
gained from the planner and reasoning over these options 
during executions. The representations used have included sets 
of partial orders [53, 54, 55, 56, 57l AND/OR graphs [5S 1 
and Petri nets [59. 60). It to proposed that the robot as- 
sembly problem as formulated by these researchers is an out- 
standing example of a problem which clearly d i stin gui sh e s be- 
tween planning and scheduling, and which dearly indicates 
the penalties incurred for ignoring the dis t inc ti ons. It 
deserves further study in the effort to develop classification 
schemes for planning and scheduling problems. It is further 
proposed that the three methods which the researchers have 
developed deserve comparative study in the effort to develop 
classification schemes for solution techniques for planning and 
scheduling problems. 


7. Conclusions 

It is concluded that there is a distinct difference between 
planning and scheduling. Until an adequate classification 
scheme can be developed for planning and sc h eduli n g problem 
types, the case of robotic asambiy under various type* of un- 
certainty about availability is offered as a paradigmatic ex- 
ample. It to hoped that this example will stimulate work on 
the classification of problem type* as well as solution tech- 
niques. 
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The properties of a temporal language are determined by its 
constituent elements: the temporal objects which it can represent* the 
attributes of those objects, the relationships between them, the 
axioms which define the default relationships, and the rules which 
define the statements that can be formulated. The methods of inference 
which can be applied to a temporal language are derived in part from a 
small number of axioms which define the meaning of equality and order 
and how those relationships can be propagated. More complex inferences 
involve detailed analysis of the stated relationships. Perhaps the 
most challenging area of temporal Inference is reasoning over 
disjunctive temporal constraints. Simple forms of disjunction do not 
sufficiently increase the expressive power of a language while 
unrestricted use of disjunction makes the analysis HP -hard. In many 
cases a set of disjunctive constraints can be converted to disjunctive 
normal form and familiar methods of inference can be applied to the 
conjunctive sub-expressions. This process itself is HP-hard but it is 
made more tractable by careful expansion of a tree- structured search 
space. 


1 . Introduction 

An intelligent autonomous system operating in a remote, unstructured 
environment must have three capabilities. First, it must be able to create a 
plan or course of action according to an initial state of the world, a goal 
state of the world, and some knowledge of its own abilities. Second, it must be 
able to determine a sequence of actions, according to the constraints on the 
steps of the plan and the evolving state of the world. Finally, it must be able 
to produce the desired effect of those actions according to its abilities and 
the present state of the world. 

The performance of the planner, the sequencer, and the executor components 
of such a system can be very much affected by the language used to represent 
plans. The concepts of action must be suitable for the planner, which must 
reason about goals and effects, but at the same time be tractable for the 
executor, which must produce the desired effects. The concepts of order must be 
sufficient for the planner, which must control undesired interactions between 
operations, but at the same time they must not Impose unnecessary constraint on 
the sequencer, which must adapt the sequence of actions to the dynamically 
changing state of the world. The methods of formulation must enable the planner 
to produce the most general plans possible, yet at the same time it must be 
feasible for the sequencer to derive a sequence of actions from those plans. The 
language must be terse. The sire of the plan must be proportional only to its 
complexity. 


•Research conducted under. the McDonnell Douglas Independent Research and 
Development Program. 
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It. hex bten reputedly souNted that reuonlsi over time is an usentlal 
eleMnt of planning and specific t uy oral represent at Iona hare boon pr o po—d to 
facilitate tha planning process. including a linear progra—lng nodal of Malik 
and Binford[l], tha space-time naps of Hlller[2] , tha interval algebra of 
Allea[3]. tha point algebra of Vllain and Ksutx[4]» and tha end-point represen- 
tation of Chees eaa n(S] . Although not concerned with planning but instead with 
tha problens of sequencing tha activities of robots. For and Kenpf[tf] propose a 
language of teaporal constraints as the target representation for planners. 

With this abundance of teaporal languages for planning and sequencing, it 
is important to establish the properties of the proposed languages and to 
understand the inference methods which can be applied to them. Although a 
complete survey of teaporal reasoning is beyond the scope of this paper, an 
examination of the basic elements of teaporal representation and the methods of 
temporal inference mill establish the primary criteria for comparing these 
languages. 

2. Elements of Temporal Representation 

The properties of s language are embodied in its syntactic form and its 
semantic interpretation. The concern here is with the semantic elements of a 
language rather than with its syntactic details. Nevertheless, in order to 
discuss the variety of possible temporal languages. < - is necessary to introduce 
some simple syntactic structures which represent abstract semantic entities. 

Temporal languages are concerned primarily with temporal objects: instants 
and Intervals of time. Some authors maintain that instants of time present some 
semantic difficulties and therefore propose that time intervals should be the 
primitive element of temporal reasoning[3] . Others maintain that intervals of 
time can be defined by their endpoints and propose that instants should be 
treated as the basic element of temporal reasoning. Some sequencing problems 
Involve activities, which in reality occur over some interval of time, but for 
purposes of analysis can be treated as atomic and indivisible, la the following 
discussion, instants of time will be treated as primitive objects, denoted by 
alphanumeric symbols such as X. T, and mime-o'clock. Likewise, intervals will 
be denoted by alphanumeric symbols, such as Z, V, Install-clip, and drill-bole, 
but when useful or necessary, the initial and final endpoints of an interval 
will be denoted by a suffix letter 1 or f attached to the interval name, such 
as Z1 and Zf. 

Teaporal languages are concerned with the attributes of teaporal objects. 
Some languages may allow the specification of the absolute time of some instant 
or it may be possible to specify the duration of an interval. Specialized 
systems may associate the properties of physical processes with intervals, such 
as rates, loads, or volumes. Planning systems may associate propositional 
variables and their values with temporal objects. Ultimately, each temporal 
object is associated with some event, activity, or proposition. For instance, it 
is possible to refer to the instant which begins an occultation, or the interval 
of time when the action install-clip is performed, or the Interval of time over 
which the proposition channel- is-available is true. 

Temporal languages are concerned with the relationships between teaporal 
objects. The most primitive Involve the relationships between instants of time. 
Two instants may be equal, denoted by the operator =. they may be inequal, 
denoted by the operator <>, or they may be ordered, as denoted by the operator 
<. In order to avoid any syntactic ambiguity, such relationships are written in 
fully parenthesized infix notation, as in the expression (X < Y). The relation- 
ships between two intervals of time, as defined by Allen, are shown schemati- 
cally in Figure 1. The relationships between instants and intervals of time can 
be defined in a similar fashion. All of these relationships can be specified by 
their respective endpoint relationships as indicated in the right hand column of 
Figure 1. 

In addition to the facilities for explicitly stating the relationships 
between temporal objects, a temporal language must include some axioms which 
define the relationships between objects that are not otherwise constrained. 
Commonly, it is assumed that, in the absence -of other explicit constraints, two 
instants of time, X and Y, are ordered as either (X < Y) or (Y < X), or they are 
equal, (X = Y) . Likewise, unless otherwise constrained, two intervals can be 
related in any of the 13 possible ways shown in Figure 1. In some applications 
involving the serial execution of a set of operations, there is no opportunity 
for any of the operations to be done concurrently. In such cases an axiom which 
defines the default relationship between intervals states that, unless otherwise 
constrained, two intervals, X and Y are disjoint and ordered as either 
(X before Y) or (Y before X). Such axioms play a significant role in the 
treatment of negation and the processes of inference. 
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Temporal languages are subject to certain rules of formulation. The 
simplest rule is to assume that a given set of primitive constraints is to be 
treated as a conjunction and that they must all be satisfied simultaneously. In 
contrast, the language defined by Allen allows a restricted form of disjunction. 
The relationship between a given pair of intervals can be specified as a 
disjunction of any of the 13 possible primitive relationships. This makes it 
possible to circumscribe indefinite relationships or to prescribe s ome relation- 
ships which cannot be expressed as one of the 13. For instance, suppose that two 
intervals, Z and Y, must begin at the same time but that there is no constraint 
on their termination. It would artificially constrain the intervals to state 
that (X begins Y) because this primitive relationship requires that X terminate 
before Y. Likewise It would be an artificial constraint to require that 
(Y begins X). Using this vocabulary of 13 primitive relationships, the 
relationship between X and Y can only be stated as a disjunction, ((X begins Y) 
or (Y begins X)). In Allens *s language, disjunction is restricted to phrases 
that define the relationship between a single pair of intervals and cannot be 
used to pose constraints such as, ((X before Y) or (Z before ff)). 

The properties of a temporal language are determined by the combination of 
these elements: the objects, attributes, relationships, default axioms, and 
rules of formulation. Together these elements determine the set of problems that 
can be represented. For instance, the language of strict partial orders is 
composed of symbols which denote Instants of time, the primitive ordering 
relationship <• the default axiom that states for all X and Y either (X < Y) or 
(Y < X), and a rule of formulation that allows only conjunctions of primitive 
ordering constraints. A given constraint expression in this language defines a 
set of admissible total orderings over a set of instants of time. For example. 
The conjunction ((X < Z) and (Y < Z)) defines 2 admissible orderings of X, Y, 
and Z: [X,Y,Z] and [Y,X,Z]. The limited rule of formulation in the language of 
strict partial orders makes it impossible to state the constraints for a problem 
which admits the 4 linear orderings [X,Y,Z]. [Y.X.ZJ. [Y.Z.X], and (Z,Y,XJ. 

There is no conjunction of primitive ordering constraints which defines exactly 
this set of linear orderings! The limited forms of disjunction included in 
Allen's interval algebra or the point algebra defined by Vilain and Kautr 
encompass some sense of indefiniteness in the relationship between temporal 
objects but these forms of disjunction are not sufficient to represent the full 
range of possible ordering problems. 

A number of common temporal representations can be quickly distinguished by 
their constituent elements. For instance, the language of equivalence classes is 
composed of symbols which denote atomic temporal objects, the equality and 
inequality relationships , = and <>, an axiom which states that for all X and Y, 
(X = Y) or (X <> Y), and a rule of formulation which allows only conjunctions of 
equality constraints. In contrast, the language of graph coloring problems has a 
similar structure but the rule of formulation allows only conjunctions of 
inequality constraints. The language of temporal constraints proposed by Fox and 
Kempf[6] is composed of symbols which denote atomic temporal objects, the 
ordering relation- ship <, an axiom of serial processes which states that for 
all X and Y, (X < Y) or (Y < X) , and a rule of formulation which allows 
arbitrary use of conjunction, disjunction, and negation. This axiom limits the 
scope of this language to problems that involve activities that must be done one 
at a time, such as a robot performing an assembly task. However, the 
unrestricted use of disjunction guarantees that this language can represent any 
problem within that domain. Portrait, a temporal language under development by 
Fox and Green allows arbitrary use of equality, ordering, conjunction, 
disjunction, and negation. 

3. Methods of Temporal Inference 

Temporal reasoning is a process of deriving the properties of temporal 
objects and the relationships between temporal objects that are implied but may 
not be explicitly stated in a given set of temporal constraints. The most 
familiar form of temporal reasoning is constraint propogation. In the language 
of equivalence classes constraint propogation is based upon two axioms. The 
first defines the symmetry of equivalence: for all X and Y, (X - Y) implies that 
(Y = X) . The second defines the method for propogating equivalence: for all X, 

Y, and Z, (X ~ Y) and (Y = Z) implies that (X - Z). There is no symmetry in the 
language of strict partial orders, only an axiom which defines the method for 
propagating order: for all X, Y, and Z, (X < Y) and (Y < Z) implies that 
(X < Z). The language of partially ordered sets includes an axiom which defines 
how a disjunction of order and equality can be propogated: for all X, Y, and Z, 
(X <= Y) and (Y <= Z) implies that (X <= Z). Coupled with this is an axiom which 
defines how constraints over a given pair of temporal objects can be resolved: 
for all X and Y, (X <= Y) and (Y <= X) implies that (X = Y) . If, after the 
complete propogation of constraints, only one of the constraints (X <= Y) or 
(Y <= X) has been imposed then it can be assumed that the two objects are not 
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equal. Vilain and Kauts define a language over instants of tine which, for a 
given pair of instants, allows an arbitrary disjunction of the 3 possible 
relationships between. that pair. In this context, the propogatlon of constraints 
can be best defined bp a matrix as shown in Figure 2. Conjunctions of 
constraints over a single pair of instants can be resolved by a rule of 
Intersection as shown in the matrix of Figure 4. Vilain has demonstrated that 
constraint propogation within this language is both complete and correct. 

Allen’s Interval algebra relies upon similar, tabular rules of Inference, but 
because of the added complexity of this language, constraint propagation is not 
guaranteed to be complete. 

In most circumstances these constraint propagation axioms can be applied in 
reverse in order to identify the essential constraints in a problem and to 
eliminate any implied constraints. Given the complete set of implied and 
essential constraints it is a simple matter to identify the equivalence class of 
some temporal object along with all of its predecessors, direct predecessors , 
siblings, successors, and direct successors. For instance, the axiom which 
defines the predecessors of a temporal object Z states that X is a predecessor 
of Z if (X < Z). The direct predecessors of Z include all those temporal objects 
X such that (X < Z) but there does not exist T such that (X < Y) and (Y < Z). 
These can be easily identified by scanning the set of essential constraints. 

Reasoning about the admissible ordering of temporal objects is directly 
related to an analysis of precedessors and successors. For instance, in the 
language of strict partial orders, the controlling axiom specifies that a 
temporal object X can occur only after all of the predecessors of X. Of course, 
those objects which have no predecessors can occur at any time. This axiom can 
be used to incrementally build sequences of activities. At each step of the 
process simply choose one of those activities which can occur next. 

In most sequencing problems the combined ordering constraints limit the 
admissible sequences of activities but do not remove every sequencing option. In 
most problems there are many admissible sequences. The number of admissible 
sequences can serve as a useful indicator of the available sequencing options. 

In some problems this may provide an estimate of the effort required to find the 
best sequence of activities. In other problems it may provide an estimate of the 
inherent flexibility that can be exploited in sequencing those activities. The 
naive approach to computing this number would be to explicitly enumerate all of 
the feasible sequences by exhaustive application of the sequencing axiom or 
other more sophisticated algorithms [7 ] . Unfortunately, the simplest of problems 
will prove the most intractable. Consider a serial task of 15 steps with no 
sequencing constraints. There exists 15! = 1,307,674,368,000 sequences. Even if 
one sequence could be generated each microsecond it would still requie 15 days 
to enumerate the entire set. Fortunately, general methods are available which 
can determine the number of feasible sequences over a strict partial order 
without explicit enumeration. These methods are first reported in a textbook by 
Wells [8] but several refinements of these methods were developed at HDRL by the 
authors. Generally, this computation can be accomplished by recursive 
application of 3 simple rules: 

( 1 ) if a set of activities can be divided into two subsets such that 
all of the activities in the first set must precede all of the 
activities in the second set, then the total number of feasible 
sequences equals the number of feasible sequences for performing the 
activities in the first set times the number of feasible sequences for 
performing the activities in the second set. 

(2) if a set of activities can be divided into two subsets such that 
all of the activities in the first set can be performed independently 
of the acitvities in the second set, then the total number of feasible 
sequences equals the total number of feasible sequences for performing 
the activites in the first set times the number of feasible sequences 
for performing the activities in the second set tiroes the number of 
ways that one sequence from the first set can be interleaved with one 
sequence from the second set. 

3) if a set of activites cannot be divided into two subsets according 
to rules (I) or (2) then that set of activites can be partitioned into 
two strategies for performing those activies which have no feasible 
sequences in common, and the the total number of feasible sequences 
will be the number of feasible sequences under the first strategy plus 
the number of feasible sequences under the second strategy. The 
partition is generated by identify a pair of unconstrained activities, 

X and Y. The first strategy is defined by the orginal set of 
constraints plus the constraint that X must precede Y, (X < v ), and 
the second strategy adds the constraint that Y must precede X, 
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(T < X). (Repeated application of these 3 rules is guaranteed to work 
regardless of the X and T chosen when using rule 3 f but the number of 
partitions generated is significantly affected by the choice. By 
carefully selecting the steps X and Y, it is possible .to control the 
number of partitions ultimately generated.) 

By recursive application of these rules it is possible to determine the 
number of feasible sequences for performing a set of activiites from start to 
finish, or it can be used to determine the number of ways of completing the task 
from any given state. In most circumstances the number of feasible sequences 
corresponds closely to the degree of flexibility inherent in the sequencing of 
the activites and it can be used as a valuable metric for comparing different 
plans or strategies. As a side-effect, application of the 3 rules stated above 
results in the decomposition of a given task into sets of dependent activities, 
sets of independent activities, and into disjoint sub- strategies. This 
decomposition can be used by human analysts to better understand the structure 
of the tasks that they must plan and coordinate. 

Onf ortunately , inference over a disjunctive language, such as that 
developed by Fox and Kempf, is much more difficult. One way of resolving the 
constraints in a disjunctive constraint expression is to convert a given set 
constraints into disjunctive normal form, i.e. a disjunction of conjunctions of 
the primitive ordering constraints, keeping only the satisfiable and non- 
redundant subexpressions. In that form, the methods of inference sketched above 
can be applied separately to each conjunction of constraints and the results 
combined under an appropriate interpretation of disjunction. The production of 
this reduced disjunctive normal form is very difficult, in fact it is NP-hard, 
but it is an essential part of more general temporal reasoning 

For instance, the constraint expression shown in Figure 4 is typical of the 
constraints imposed on small assembly problems. Production of the disjunction 
normal form of that constraint expression, using the distributive law of boolean 
algebra, (X and (Y or Z)) — > ((X and Y) or (X and Z)), results in a set of 1024 
conjunctions. In general, the sise of the disjunctive normal form grows 
exponentially with the number of applications of the distributive law. Some of 
the resulting conjunctions are inconsistent and should never be considered, 
others are specific cases of more general sub-expressions in the result and can 
safely be removed. Other simple methods for producing the disjunctive normal 
form have the same result. However, all of the admissible sequences for 
performing the task defined by these constraints are embodied in only 22 
conjunctions. 

An efficient method for deriving that set of 22 conjunctions is closely 
related to methods for determining the satisfiability of boolean expression and 
is based on the expansion of a tree structured search space. Each node in the 
search space consists of 2 parts. The first is a partially formed conjunction, 
and the second is a constraint expression which remains to be satisfied. The 
root node consists of an empty conjunction coupled with the initial constraint 
expression. Successor nodes are formed by propagating primitive constraints from 
the constraint expression into the conjunction being constructed. The target 
leaf nodes consist of a completed conjunction which satisfies the original 
constraint expression and an empty set of constraints remaining to be satisfied. 
Specific heuristics have been developed which make it possible to prune 
redundant or inconsistent solutions early in the tree expansion. Using these 
methods the constraint expression shown in Figure 4 produced 28 consistent 
conjunctions, 6 of which were subsequently identifed as redundant. Subtree 
expansion was terminated 58 times because inconsistencies were detected and 12 
times because redundancies were detected. This is considerably more efficient 
than producing 1024 conjunctions and then attempting to prune the 
inconconsistent and redundant sub-expressions . 
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4. Conclusion 

The properties of a temporal language are determined by its constituent 
elements: the temporal objects which it can represent, the attributes of those 
objects, the relationships between those objects, the axioms which define the 
default relationships, and the rules which define the statements that can be 
formulated. The methods of inference which can be applied to a temporal language 
are derived in part from a small number of axioms which define the meaning of 
equality and order and how those relationships can be propagated. Hore complex 
Inferences involve detailed analysis of the stated relationships. Perhaps the 
most challenging area of temporal inference is reasoning over disjunctive 
temporal constraints. Simple forms of disjunction do not sufficiently increase , 
the expressive power of a language while unrestricted use of disjunction makes 
the analysis NP-hard. In many cases a set of disjunctive constraints can be 
converted to disjunctive normal form and familiar methods of inference can be 
applied to the conjunctive sub -express ions. This process itself is NP-hard but 
it is made more tractable by careful expansion of a tree- structured search 
space . 
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Thirteen possible interval relationships. 
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Matrix of constraint propagation in the point algebra. 
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Matrix of constraint resolution in the point algebra. 
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Typical disjunctive constraints on the steps of an assembly problem. 
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1. Abstract 



Most current A! planners build partially ordered plan structures which delay decisions on action ordering. Surch / 

structures cannot easily represent contingent actions. TOa-^apec^zrescntt ^ representation which can^The s f 

representation has some other useful features: it provides a good account of the causal structure of a plan, can be / 
used to describe disjunctive actions, and it offers a planner the opportunity of even less commitment than the 
classical partial order on actions. The use of this representation is demonstrated in an on-board spacecraft activity 
sequencing problem. Contingent plan execution in a spacecraft context highlights the requirements for a fully 
disjunctive representation, since communication delays often prohibit extensive ground-based accounting for 
remotely sensed information and replanning on execution failure. 


2. Introduction. 


Plan generation isn’t problem solving. Planning problems are physical realities which require physical solutions. 
Planning can only be construed as problem solving when it’s part of a larger system which also addresses plan 
execution; only execution can realize the solution a plan specifies. We use this theme of plan execution to bring 
together some important issues in AI planning. We consider least commitment plan construction, the representation 
of teleological information, disjunctive plans, and contingent plan executior in realistically complex domains. 

We begin in the next section by briefly discussing the way that most AI planners operate. Commonly used 
techniques include least commitment action ordering and object selection; we discuss both. Following this, in 
section 4, we describe an actual planner called O-Plan [1] which uses these techniques to good effect We cover 
the essentials of O-PIan’s search for an acceptable plan, leaving aside low level details. This discussion is used to 
show how O-Plan relegates the responsibility for reasoning about disjunctive actions to its search space management 
component We argue that what a planner needs is a plan structure which is able to describe the disjunction of 
action implied by the choices encountered during plan construction. In section 5 we present a solution to the 
problem. A representation is given which has the properties we seek: it can be used to do least commitment plan 
construction; it explicitly represents teleological information; and it can describe disjunctive actions. Together these 
abilities allow our plans to be used for plan execution in realistically complex domains. To motivate this, section 6 
places the ideas in the context of a spacecraft activity sequencing problem: planetary observation. This example 
causes us to reflect on the basic principle of least commitment problem solving in general, since it supports a form 
of least commitment reasoning which commits even less than current techniques. 

The primary result of this paper is a representation we call C-Plans . We claim that the representation is 
suitable for use in sequencing the activities of automated spacecraft. Further applications-oriented research is 
required to substantiate this claim. 

3. Current AI planners: least commitment plan construction. 

An AI planner is given the responsibility of constructing a plan of action. Such a planner is given an initial state 
description, a set of goals, and a set of action schemas. The schemas are parameterized plans, suitable for solving 
limited problems. A plan produced by the system is an artifact built " om individual operators, appropriately 
instantiated and ordered. This plan must be sanctioned by the system as a feasible means of achieving the given 
goals. In this section we examine briefly two of the main operations required to produce this plan: action ordering 
and variable instantiation. 
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3.1. Actios ordering. 


Early planners built locally ordered structures: a plan was a sequence of actions. This not only applied to the final 
plans produced by the system, but also to the partial plans that were built during search. With Sacerdotfs ndaii (3) 
system this aQ changed. NOAH built plans as partial orders on actions. This me a n t that it was possible for any two 
given actions to be ^ordered with respect to each other. The intuition behind this idea is that a partial order on 
actions characterizes very many total orders. Today such plans are often called nonlinear. It might seem that a 
system which builds nonlinear plans would be exponentially more efficient than one which builds linear, or totally 
ordered plans. Unfortunately this has never been proven. Only the intui tio n exists that n o n l ine a r is better than 
linear; but this intuition is better chan nothing. See Chapman [4] and Drummond (5] for more on this. 

31 Object selection. 

There is another son of least commitment found in some planners which relates to the way that the objects referred 
to in plans are selected. Variable instantiation is the process of selecting constants to bind to variables. Each 
variable can be bound to a specific constant; or unbound, meaning chat no constant has yet been selected as 
appropriate. But it is possible to operate in a more sophisticated way: we can post constraints on the permissible 
constants for any given variable. In this way. we constrain the possible bindings for a variable, rather than scLt. 
one as correct outright Information can be gathered during the process of plan con st r u ction which leads to the 
deletion of particular constants from the set of possibilities. The hope is that eventually the set of possibilities will 
be narrowed to one alternative, or reduced to the empty set. indicating that the constraints posted on the variable are 
so stria that there is no satisfactory object This method of associating constants with variables is known as least 
commitment object selection. Its genesis was in Molgen [2]. 

4. A framework for doing this: O-PIan. 

O-Plan is a modem planning system which owes many of its ideas to NooLin [6]. NonLin derives from NOAH, and 
extends it in many ways. For our purposes the essential contribution of NonLin is its completion of the search 
space of partial plans: NonLin could find plans that noah could not. This is because NonLin had plan modification 
operations available to it which defined its search space of partial plans so as to include plans that noah would 
never consider. O-Plan inherits its definition of the search space from NonLin. In the next section we consider the 
mechanisms used by O-Plan to search the space of possibilities. We explain how it keeps track of alternatives, and 
how it searches through the space of partial plans. We then go on to consider how this mechanism can be extended 
through a more flexible representation for plans. 

4.1. Agenda-based partial plan search. 

The planning components in the O-Plan framework employ various techniques to lessen the amount of potential 
search in any particular application. The techniques include least commitment variable binding, constraint cut-off. 
temporal coherence and various heuristic functions. O-Plan searches through a space of partial plan sates, guided 
by these techniques, where each partial plan sate is derived from the application of a plan modification operator to 
some current partial plan. This is essentially a search space of plan modifications, or operations whose application 
results in a new (partial) plan state. 

An O-Plan Plan State is a structure of some detail and it includes the partial order of activities that is 
currently being built (essentially the plan so far), a log of effects and conditions asserted or required in the plan, the 
teleological information used during plan generation (and available thereafter), variables used during planning and, 
finally, information on outstanding tasks generated during the planning process. These pending tasks are collected 
together into agenda lists from which each task can subsequently be scheduled in some opportunistic fashion. This 
mechanism provides for a dynamic approach similar to that provided by “blackboard** based systems. 

In practice there are two main agenda list Opes, cne for task specif rations which are fully instantiated, and 
one for task specifications where certain information has yet to be determined- A third “alternatives** agenda list is 
currently employed which should eventually disappear, but which has been used in the absence of a complete 
method for dependency- based plan repair. 


Task sele ct ion is done under the control of a scheduler, which provides the opportunism for the overall 
process. This scheduler can be regarded as a "plug-in” module in the O-Plan system and therefore it can reflect 
various scheduling strategies. The scheduling of a task from the agendas causes a handler (or knowledge source) to 
process dot particular task. The relevant handler is invoked by the type of the task scheduled hence the system is 
data driven by the tasks themselves. As well as changes to the current Plan State, the processing of a task generally 
results in the creation of new tasks or the amendment of existing tasks on the agendas. 

When choices are made, the task handlers have the option to either post dependency information in the Plan 
State, or to simply spawn alternative Plan States via the alternatives agenda mentioned earlier. The former method 
has the advantage that it offers the potential for proper plan repair where only the affected parts of the current 
(partial) plan are stripped off after a failure, while useful parts are saved and the work dooe in producing them 
protected. We are researching how this can be done by using partial plans augmented with teleology information, 
although there are many outstanding problems. 

Processing proceeds in cycles and finishes when all tasks have been processed or when there is reason for a 
particular task to terminate planning. In theory the handlers are independent of one another but they do have the 
ability to "poison** the current Plan State if they detect inconsistency or constraint violations. This is the time to 
backtrack, plan repair or simply give up. Search through the space of partial plans is therefore controlled by the 
scheduler which chooses the next best thing to do, using information provided by the tasks generated during 
planning. More detail of the OPlan control structure can be found in [1J. 

4.2. The requirement for truly disjunctive structures. 

O-Plan searches through a space of partial plans. When there's a choice that cannot be delayed, the current O-Plan 
task scheduler pursues one of the available options by incorporating it into the current Plan State. On failure, O- 
Plan may reconsider all previous Plan States on the alternatives agenda and pursue a previously ignored plan 
modification operation. In this way it follows a “one-then-best” search strategy as in NonUn. 

An alternative approach is demonstrated by the following scenario. Consider that at some point during its 
search for an acceptable plan, the system identifies an outstanding goal G . Assume that there are two action 
schemas which after analysis appear suitable for achieving G. The traditional approach says that this choice 
induces a bifurcation in the search space, each path considering one of the two possible actions. However if our 
developing plan is able to represent disjunction, such a bifurcation is unnecessary. Both possible actions (resulting 
from instantiating the schemas’ variables) can be installed in the plan. The only requirement is that the plan record 
the fact that these two actions stand in a disjunctive relationship. 

By the above discussion we aren’t suggesting that a planner consider all possible options at each point in its 
search; such behavior is doomed to failure, since the number of options open will inevitably be huge. Much of the 
information needed for later planning also becomes uncertain in a plan with too much disjunction. However if the 
plan representation is able to describe disjunction, then the system will have the option of including action 
disjunction as appropriate. 

Contingent plans are also necessary for doing realistic plan execution monitoring. When a plan is generated, 
it's unlikely that the generation component can guarantee what the world will be like when plan execution begins. 
To properly handle this we need disjunctive plans. The planner can produce plans which contain actions to deal 
with whatever contingencies it deems worth considering. Such a continge n ' plan must specify the conditions under 
which each of the planned actions is appropriate, to allow the execution component to correctly select which action 
to execute. 

So: we would like to formalize a plan structure able to represent disjunction of action. But in doing this 
there's a trap to avoid. We could easily over-simplify the data structures used by a system such as O-Plan. It 
would appear possible to formalize a nonlinear plan as a partially ordered set Mathematically all one requires is a 
set of actions and an ordering relation over that seL (See [4] for an example.) The ordering relation is required to 
be irreflexive and transitive, therefore asymmetric. The problem with such a simple formalization is that is fails to 
capture much of the information that 0-P!an exploits during plan generation. In particular, it does not capture the 
goal structure of a plan (7); that is, the causal structure that exists among the planned actions. 

There are other requirements on the formalization that we won't consider in this paper. In particular, we 
won’t address formalizing least commitment object selection. Data structures to support such operations are simple 
to formalize, but for ease of exposition, we won’t do it here. It is straightforward to add this to the formalism we 
present. 
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5. Formalizing contingent plans. 

We can borrow some notions and notation from Net Theory [8]. Not all the constructions that we need are part of 
net theory, so we’ll have to add a few bits onto the basic framework. We won’t motivate our additions; far a brief 
discussion, see [5], and for more extensive motivation 19]. Essentially, we use Condition/Event systems augmented 
with event occurrence preference orderings; we also identify the conditions and events of the system with predicates 
of a simple language. In this section, weTl proceed by informally defining the constructs of our plan language, 
building up the overall structure we tequire. The eventual goal is to define C -plans, or Contingent Plans, following 
on the arguments above. It is possible to be quite formal in defining these C-plans, but this paper simply explains 
and motivates them. 

5.1. Basic C-plan structure. 

A proposition is a functor applied to arguments. A functor is written in lower case, followed by its arguments in 
parentheses. Arguments are variables or constants; we allow infinitely many of each. Variables are written in upper 
case, constants are written in lower case. For example, both onfaXh and skew-platform( 15, left) are propositions. 

Propositions are identified with what we call b-elements and e-elements. A b-element is intended to denote a 
condition in the world, and can be true or false. For instance, the b-eJement clearfc) under a blocks world 
interpretation is true if and only if the block denoted by c has nothing on its upper surface. Propositions are also 
identified with e-elements. An e-element is intended to denote an action, the occurrence of which changes the 
holding of certain conditions. 1 For instance the e-element move(aJ>.c) in a blocks world context might denote the 
action of moving the block denoted by a from the block denoted by b to the block denoted by c. Certain conditions 
must hold if this action is to occur; furthermore, when the action does occur, certain conditions in the world wifi no 
longer hold, and certain others which did net hold will begin to do so. For example, in the case of the block 
movement we might expect that a can only be moved from b to c if a is initially on b . Following the movement, a 
will be on c. We need to capture these condition-action relationships in our plan representation. 

To do this we introduce the notion of a flow relation, A flow relation is a set of ordered pairs, each pair in 
the set ordering either a b-element and e-element, or e-element and b-element. The ordering of a b-element and e- 
element is interpreted as an enable relation. Thus, the holding of certain conditions is understood to enable the 
occurrence of certain actions. The ordering of an e-element and b-element is interpreted as a cause relation: actions 
can cause the holding of certain conditions. The flow relation describes the relationship between any given event 
and that event’s enabling conditions and effects. It captures what O-Plan and NonLin call Goal Structure; the best 
dictionary word for this concept is probably teleology . We use the word to refer to the reasons for some event or 
condition being included in a plan. The flow relation of a net allows a formal analysis of which actions can be used 
to enable which other actions; this is essentially the reasoning that O-Plan performs to generate a plan. Other 
modem planners, such as sipe [10] also include such infoimation in their plan data structures. 

We will refer to the b-elements which are ordered immediately before an e-element as that e-elcment’s 
preconditions ; similarly, we will refer to the b-elements ordered immediately after it as its postconditions. 

Graphically we present b-elements as circles and e-elements as squares. Each circle is labeled with the 
proposition which is the b-element, and each square is labeled with the proposition which is the e-element The 
flow relation is drawn as arcs from circles to squares and from squares to circles. If an arrow is to go from a circle 
to a square, and another from the same square to the same circle, we draw only one line, and use an arrow-head on 
each end of the line to indicate the two arcs. 

One other ordering relation is needed to complete the basic C-plan structure. This is the before relation, used 
to constrain the way that a net can execute. Intuitively, the before order is a specification of which events mu?* 
occur before which other events if a plan is to run to its intended completion. We often refer to the before rclaticv: 
as execution advice . Consider the cause and enable orderings in a C-plan’s flow relation describe what is causally 
possible. But in planning we are often interested in only one of generally many causally permitted execution 
sequences. Causal orderings will not always uniquely constrain a set of actions to describe just those behaviors 

1 We use ihe terms * ‘action’ * and “event” interchangeably, as convenient. 
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which achieve a planner’s overall goals. 

A classic example of this occurs in blocks-world tower construction problems. For example: given the 
problem of creating a tower with block C on the bottom, block B in the middle, and block A on top, the plan 
construction reasoning must order the two required stack actions to reflect its overall goals. To see this, assume 
that all blocks are initially clear and on the table. If a plan calls for stacking A on B, and B on C, then both stack 
actions are enabled in the initial state. It is not an ordering enforced by causation that requires the stacking of B on 
C before A on B. Rather, it is the agent’s intention regarding overall plan execution outcome that directs the 
sequencing of the two actions. 

So a C-plan is defined by specifying a set of b-elements (which denote the conditions of interest in the 
domain being modelled), a set of e-elements (which denote the relevant actions), and an ordering relation on the 
members of these two sea (technically, the relation is bipartite, since it orders members of two different sets). The 
C-plan is augmented by giving some execution advice for causally underconstrained actions. This advice takes the 
form of an ordering relation on C-plan e-elements. To keep the graphical presentation of C-pians simple, we do not 
draw arcs between e-elements ordered in the execution advice. Instead, the ordered pairs are simply listed beside 
the net 

A simple blocks world plan basically compatible with what we have defined here can be found in [11]. 

52. C-plan projection. 

We now have to say something about the projection of a C-plan. A projection is a structure which supports 
reasoning about the behaviors that a C-plan describes. First we must say something about the conditions under 
which events can occur and what changes they realise by occurring. Second we must build up the projection 
structure which describes the overall behavior of a C-plan, using the definition of individual event occurrence as a 
building block. E-element occurrence can be used as a “state generates* to create a state-space account of the 
behaviors permitted by a plan. 

We will call an arbitrary set of b-element propositions a case . We interpret such a set of propositions as a 
partial description of a state of the world. If a proposition is in a case, then it is true; if it is not in the case, then it 
is false. Graphically, we present cases only in terms of C-pians - when doing so, we place a dot (a token) inside 
each and only those circles labelled with propositions in the plan which are also in the given case. 

We can use this idea of a case as a partial world state description to define when an individual e-element is 
enabled; that is, when the action it denotes is allowed to occur. To model this, we can say that an e-element is 
enabled in a case if and only if its preconditions are a subset of the case, i.e., if the enabling conditions of the event 
are true. We also require that none of the e-element’s postconditions are already in the case, unless they are also 
preconditions. Further, we can specify how the world is changed under the occurrence of the action, by defining 
how an e-element’s enabling case is modified to gain a successor. We can generate a new case through the 
occurrence of an e-element the new case is defined to be the old one, minus all the e-eiement’$ preconditions, plus 
all the e-element’s postconditions. The effects of an event are made true in the successor case, and the enabling 
conditions are made false. If a precondition is not made false by the occurrence of an action, one need only make 
the relevant b-element a postcondition of the e-element as well. 

This definition of e-element occurrence can be used to build up a state-space graph structure which tells a 
story about the possible behaviors of a C-plan. Given an initial case and a C-plan, we can build up a projection 
graph as follows. The initial case is used as the starting node of the projection graph. E-elements of the given C- 
plan are repeatedly applied in non-terminal projection graph cases until there are no more cases in which any of the 
C-plan *s e-elements have concession. Arcs leading from node to node in this graph are labelled with e-elements. 
An arc directed from one node a to another node {3 indicates that the e-elcinent labelling the arc has concession in 
the case contained in a and under occurrence, produces the case contained in (3. 

The idea is that the graph structure defined in this way contains a given initial case as its starting node, and 
that each node in the graph contains a case reachable under e-element occurrence. With the interpretation of a case 
as a partial description of the world, the projection gives us a prediction of what a C-plan can do in terms of the 
possible world states it might give rise to. The initial case describes the “current” state of the world, and cases in 
the graph reachable from the initial case describe future possible world states. The arcs in the graph denote 
transitions from one world state to another, and these transitions can be realized through the actual execution of the 
actions that correspond to the e-ele merits labelling the arcs. 
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So the nodes of our projection graph contain cases, and the arcs are labelled with steps. We can map this 
structure onto the classical AI picture of planning as follows. The first node of our projection is the initial state 
given in the problem specification. In order to represent a solution to the problem, the plan's projection must give 
rise to a node which contains the required goals. We can say that a C-pIan is a potential solution to a planning 
problem if it is applicable in the initial case of the problem, and under projection gives rise to a case which 
contains the given goals. Also, a particular case reachable under e-element occurrence in a partially developed C- 
Plan can be used for "Question Answering" operations in the planner during plan generation. 

Using the idea of projection we can now say something more precise about a C-plan’s execution advice. 
Recall the basic idea. Execution advice must contain the information required to remove harmful residual non- 
determinism. The advice should not restrict legitimately causally independent actions from occurring concurrently, 
but it should prevent planned actions from occurring in an order permitted by the causal structure of the plan but 
unintended by the planner. We can explain the meaning of a plan's execution advice by interpreting it as a guide 
to navigation through the projection structure. Basically, we say that a C-plan*s execution advice is sound (with 
respect to a given problem specification) if and only if for all choice points in the projection, if there is any hope 
for success at the choice point, then either all choices lead to success, or for each choice point that could lead to 
failure, there is advice about another possible alternative, such that the suggested alternative can lead to success. In 
essence, when there is still hope for success the advice prevents the wrong sequencing choice from being made. To 
achieve this the advice must prescribe an order on e-eiements which prevents certain paths through the projection 
from being considered at execution time. 

It is possible to generalize the projection we have defined to deal with least commitment reasoning about 
action ordering. To do this, one need only say when a set of e-elements are causally independent, and use this 
definition to specify when sets of e-elements can be applied to a case, in bulk, to derive a successor. If this is done 
the arcs of the projection graph are labelled with sets of e-elements which describe the parallel occurrence of the 
denoted actions. This means that if some events are causally independent the e-elements which describe them can 
be applied as a set, and reasoning can continue from the resulting case. 

6. A spacecraft activity sequencing example. 

This section presents an example problem and its representation using C*plans. This problem would be difficult if 
not impossible to represent using the classic partially ordered structures found in systems like NOAH and NonLin. 

The basic scenario for the example is as follows. While on a deep space mission, a spacecraft is to pass very 
close to the planet Jinx. Earth-based observation has determined that two weather systems obtain on Jinx: crystal 
clear skies and turbulent sand storms. While it isn't known exactly what conditions will hold when the spacecraft 
arrives, it is certain to be one of these two. So useful observations can be made regardless of the atmospheric 
conditions. If the atmosphere is unclouded, then visible light pictures should be taken. If a sand storm is in 
progress, then infrared pictures will be most effective. 

The camera used for visible light and infrared pictures is the same, it is impossible to take a visible light 
and infrared picture in parallel. An initialization step is required in order to prepare the camera for visible light or 
infrared work. Regardless of the sort of picture taken, a digital image is stored in a frame buffer on board. The 
frame buffer is only large enough to store one picture. Each time a picture is written to the frame buffer by the 
camera, a transfer operation must free the buffer by copying the information to an on-board tape storage medium. 
For this simple example, we do not address the problem of transferring the stored images back to Earth. 

It would be nice to avoid specifying an observation programme rigidly in advance. Since Jinx is too far from 
Earth to permit the up-loading of an appropriate command sequence (using information gathered closer to the 
encounter) it is preferable to be opportunistic, and exploit the atmospheric conditions which obtain when the 
spacecraft arrives. During the period of contact, conditions may change, and (he pictures being taJcen should reflect 
current opportunity. 

From an AI planning perspective, the problem is to have a plan which represents the disjunctive observational 
requirement simply and economically. Notice that it is not a problem to have an on-board computer which runs a 
contingent program during the Jinx encounter phase. In principle, the program could be written in any language 
whatever, compiled, and up-loaded to the spacecraft well in advance. But for an AI planner the problem is one of 
representing the disjunction in a way that permits reasoning about a plan, since the plan will form part of a lager 
scenario with unexpected events and changing requirements. We give a C-plan which does this. It specifies what 



each of the individual observation operations are and the conditions under which they are to be canied out 

The plan of figure 1 is projected in figure Z The projection describes the behaviors that are possible for the 
plan. Each arc in the projection is labelled with an integer as used for each event in figure 1. Notice that for this 
example no execution advice is required. See [II] for an example of how this ordering relation is used. 

The plan describes the following behaviors. While it doesn't matter what conditions obtain when the 
spacecraft arrives at Jinx, assume for the sake of argument that: the spacecraft camera is initialized for infrared 
work; that the weather on Jinx is clear, and that the frame buffer is empty. A case which describes these conditions 
is contained in the projection node 51. Two events are possible, as described by the C-plan’s e-elements clouding 
(2) and sefupfvis) (4). Clouding (2) denotes the event of the atmosphere becoming clouded by a storm. The 
setup(vis) (4) e-element denotes the action of configuring the camera to take visible light pictures. Similarly, the e- 
element setupfir) (3) denotes the action of configuring the camera lo take infrared pictures. 

There are two tight cycles in the projection, one between 52 and 53, and one between 54 and 55. These 
cycles model the normal behavior of the plan during a period when the atmosphere is in a stable state. A transition 
from 52 to 53 models the action of taking a visible light picture, and a transition from 53 to 52 models the action 
of transferring the picture information from the frame buffer to tape. Likewise, a transition from 55 to 54 models 
the action of taking an infrared picture, and a transition from 54 to 55 models clearing the frame buffer to tape. All 
other transitions in the projection can be easily read as setup actions in response to changes in the planet's 
atmosphere. 

7. Conclusions. 

There is a relationship between the choices of action schema to achieve goals at plan generation time, and 
contingent plans which support flexible plan execution. Plan generation is reasoning about goals and the means to 
achieve them. Plan execution is about actually realizing these promised goals. If fast, efficient, and flexible A! 
planning systems are to ever exist, they must strike a balance between reasoning about disjunction in advance, and 



Figure 1: A contingent plan for taking visible light or infrared pictures. 
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Figure 2: The projection of the pictuit-takkg plan. 


reasoning about it only when necessitated by plan execution failures. This paper goes some way towards the 
construction of such a planner by defining a flexible and expressive plan representation which has the ability to 
represent disjunctive plans. It does this without losing information such as Goal Structure, used by systems like 
Nonltin [6], O-Plan [1] andstPE [10]. 

It is important to realize that what we have defined is a representation able to describe contingent actions 
which is useful from an A2 perspective. It is not hard to write contingent computer programs. But the eventual 
goal is to automate spacecraft command generation. It is likely that AI techniques will be used to perform this task. 
A start at this has been made with the Deviser planner for the Voyager spacecraft [12]. What this means is that AI 
representations must be used, and where inadequate, must be improved. Since disjunctive situations will often arise, 
any planner automatically generating spacecraft commands must be able to reason about disjunction. 

We are now working on adapting O-Plan to generate C-pIans. Simple disjunctive plans can already be 
generated; more interesting examples will require more complex generation algorithms. We are currently working 
on an algorithm to achieve a specified marking in a Petri Net to help produce a robust and efficient C-pIan 
generation algorithm. 
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>moua mobile robot reason and plan complex tasks in real-world environments/ To cope with tbs 
use a highly reactive system to which is attributed attitudes of belief, desire, and intestioo. Because these 
attitudes axe explicitly represented, they can be manipulated and reasoned about, resulting in complex goal-directed and reflective behaviors. Unlike most 
planning systems, the plans or intentions formed by the system need only be partly elaborated before it decides to act. This allows the system to avoid 
overly strong expectations about the environment, overly constrained plans of action, and other forma of over-commitment common to previous planners In 
addition, the system is continuously reactive and has the ability to change its goals and intentions as situations warrant. Thus, while the^ystem architecture 
allows for reasoning shout means and ends in much the same way u traditional planners, it also poetesses the reactivity required for survival in complex 
real- world domains. yJ P V *' 

Ws hsvr t sstsH jhr systematising SRT> autonomous robot (Flakey) in a scenario involving navigation and the performance of an emergency task in a space 
station scenario.^ 


1 Introduction \ 

One feature that is critical to the survival of all living creatures ia their ability to act appropriately in dynamic environments. For lower life forma, it 
seems that sufficient capability is provided by sti mu Pus- response and feedback mechanisms. Higher life forms, however, require more complex abilities, 
such as reasoning about the future and forming plans V action to achieve their goals. The design of reasoning and planning systems that are s'faafed in 
environments with real time constraints can thus be seeds.*! fundamental to the development of intelligent autonomous machines. 

In this paper, we describe a system for reasoning about amd performing complex tasks in dynamic environments and demonstrate how the system can be 
applied to the control of an autonomous mobile robot. The system, called a procedural reasoning system (PRS), is endowed with the psychological attitudes 
of belief, desire, and intention. At any instant, the actions tftat the system considers performing depend not only on the current desires or goals of the 
system, but also on it* beliefs and previously formed intent ions ^The system also has the ability to reason about its own internal state - that is, to reflect 
on its own beliefs, desires, and intensions and to modify these as sX chooses. This architecture allows the system to reason about means and ends in much 
the ssme way as traditional planners, but provides the reactivity required for survival in complex real-world domains. 

As the task domain, we envisage a robot in a space station acting u wu astronaut’s assistant. When asked to get a wrench, for example, the robot works 
out where the wrench ia kept, plana a route to get it, and goes there. \f the wrench is not where expected, the robot may reason further about bow to 
obtain information on its whereabouts. It then finally returns to the astronaut with the wrench or explains why it could not be retrieved. 

In another scenario, the robot may be midway through the task of retrievmg\be wrench when it notices a malfunction light for one of the jets in the reactant 
control system of the space station. It reasons that handling this malfunction of higher priority than retrieving the wrench and sets about diagnosing the 
fault and correcting it. Having done this, it continues with its original task, finally telling the astronaut what has happened. 

To accomplish these tasks, the robot must not only be able to create and execute plana, but must be willing to interrupt or abandon a plan when 
circumstances demand it. Moreover, because the world in which the robot ia situated ia continuously changing and because other agents and processes can 
issue demands at arbitrary times, performance of these tasks requires an architecture that ia highly reactive as welt as goal-directed. 

We have used PRS with the new SRI robot. Flakey, to accomplish much of the two scenyios described above, including both the navigation and malfunction- 
handling tasks. In the next section, we discus# some of the problems with traditionaPpUnning systems. The architecture and operation of PRS is then 
described, and Flakey ’s primitive capabilities are delineated. We then give a more detailed analysis of the problems posed by this application and our 
progress to date. We concentrate on the navigation taak; the knowledge base used for the jet malfunction handling is described elsewhere [15,17]. 


2 Previous approaches 


Most architectures for intelligent autonomous systems consist of a plan constructor and a plan executor. Typically, the plan constructor plans an entire 
course of action before commencing execution of the plan [1,1 1,25.28,30.32.33,3*4). The plan itself is usually composed of primitive actions - that is, actions 
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that an directly performable by the system. The motivation for thie approach, of couree, ia to eneun that the planned sequence of acuone actually achieves 
the prescribed goal. As the plan is executed, the system performs the primitive actions in the plan by calling various low-level routines. Usually, execution 
is monitored to ensure that these routines achieve the desired effects; if they do not, the system may return control to the plan constructor to modify the 
existing plan appropriately. There are, however, a number of serious drawbacks with this architecture as the basis for the design of autonomous agents. 

First, this kind of planning is very time consuming, requiring exponential search through potentially enormous problem spaces. It is thus usual for classical 
AI planners to spend considerable time thinking before performing any effector actions. While this may be acceptable in some situations, it is not suited to 
domains where replanning is frequently necessary and where system viability depends on readiness to act. In real-world domains, unanticipated events are 
the norm rather than the exception, necessitating frequent replanning. Furthermore, the real-time constraints of the domain often require almost immediate 
reaction to changed circumstances, allowing insufficient time for this kind of planning. 

Second, in real-world domains, much of the information about bow best to achieve a given goal ia acquired during plan execution. Fo/exgmple, in planning 
to get from home to the airport, the particular sequence of actions performed depends on information acquired on the way - such as which turnoff to take, 
which lane to get into, when to slow down and speed up, and so on. IVaditional planners can only cope with this uncertainty in two ways: (1) by building 
highly conditional plans, most of whose branches wilt never be used, or (2) by leaving low-level tasks to be accomplished by fixed primitive Operators 
that are themselves highly conditional (e.g., the intermediate level actions (ILAs) used by SHAKEY [23]). The former esse is combinatorial^ explosive or 
simply cannot be done - the world around us is simply too dynamic to anticipate all circumstances. The latter, as usually implemented, seriously restricts 
flexibility and reasoning capabilities. Of course, in situations where we can paint ourselves into a corner, some preplanning is ne ces s ar y. But even this need 
not involve expanding plans down to the level of primitive operators; indeed, we may do the planning in quite a different abstraction space than that used 
to guide our actions in the real world (see, for example, the representations in the missionaries and cannibals problem discussed by Amard [3]). 

A third drawback of traditional planning systems is that they usually provide no mechanisms for reacting to new situations or goals during plan execution, 
let alone during plan formation. For example, many robots (e.g., SHAKEY [23]) effectively shut off their abilities to react to new situations and goals 
while moving from one location to another. Only low-level feedback mechanisms and emergency sensors such as collision detectors remain enabled. Such 
disregard for sensory input is particularly undesirable in realistic environments in which unpredictable events may occur or other agents may be active - 
because of innaccurate information about the actual state of tbe world, actions may be chosen that are inappropriate to achieving the goals of the system. 
By remaining continuously aware of the environment, an agent can modify its actions and goals as the situation warrants. 

Indeed, the very survival of an autonomous system may depend on its ability to react quickly to new situations and to modify iu goals and intentions 
accordingly. For example, in the scenario described above, the robot must be capable of deferring the task of fetching a wrench when it notices something 
more critical that needs attention (such as a jet failure). The robot thus needs to be able to reason about its current intentions, changing and modifying 
these in the light of its possibly changing beliefs and goals. White many existing planners have replanning capabilities, none have accomodated modifications 
to the system’s underlying set of goal priorities. 

Finally, current planners are overcommitted to the planning strategy itself - no matter what the situation, or how urgent the need for action, these systems 
atwags spend as much time as necessary to plan and reason about achieving a given goal before performing any external actions whatsoever. They do 
not have the ability to decide when to stop planning, nor to reason about the trade-offs between further planning and longer available execution time. 
Furthermore, they are committed to one particular planning strategy, and cannot opt for different methods in different situations. This clearly mitigates 
against survival in the real world. 

In sum, the central problem with traditional planning systems may be viewed as one of overcommitment. These systems have strong expectations about 
the behavior of the environment and make strong assumptions about the future success of their own actions. They are strongly committed to their goals 
and intentions and, except in certain simple ways, cannot modify them as circumstances demand. This would be fine if it were possible to build plans 
that accommodate all the complexities to which an agent must be responsive; unfortunately, in most real-world domains, the construction of such plans is 
infeasible. 

Of course, we are not suggesting that preplanning, followed by later replanning, can be completely avoided: because of unanticipated changes in the 
environment, an agent will often have to reconsider its goals or its intended means of achieving these. This is a property of the environment that an agent 
can do little about. If the agent did not make some assumptions about the behavior of the environment, there is little chance it would ever be able to act. 
On the other hand an agent should not make too many assumptions about the environment - to tbe extent possible, decisions should be deferred until they 
have to be made. The reason for deferring decisions is that an agent can only acquire more information as time passes; thus, the quality of its decisions can 
only be expected to improve. Of course, there are limitations resulting from the need to coordinate activities in advance and the difficulty of manipulating 
excessive amounts of information, but some degree of deferred decision-making is clearly desirable. 

There has been some work on developing planning systems that interleave plan formation and execution [10,21,29]. While these systems can cope far 
better with uncertain worlds than traditional planners, they are still strongly committed to achieving the goals that were initially set them. They have no 
mechanisms for changing focus, adopting different goals, or reacting to sudden and unexpected changes in their environment. The reactive systems used 
in robotics also handle changes in situation better than traditional planning systems [2,7,18]. Even SHAKEY [23] utilised reactive procedures (ILAs) to 
realise the primitive actions of the high-level planner (STRIPS), and this idea is pursued further in some recent work by Nilsson [24). However, there is no 
indication of how these systems could reason rationally about their future behaviors, such as to weigh the pros and cons of taking one course of action over 
another. 


3 Knowledge Representation 

The system we used for controlling and carrying out the high-level reasoning of the robot is called a Procedural Reasoning System (PRS) [15]. * The system 
consists of a data hose containing current heltefs or facts about the world, a set of current goals or desires to be realized, a set of procedsres (which, for 
historical reasons, are called knowledge areas or KAs) describing how certain sequences of actions and tests inay be performed to achieve given goals or to 
react to particular situations, and an interpreter (or inference mechanism) for manipulating these components. At any moment, the system will also have 
& process stack (containing all currently active KAs) which can be viewed as the system's current intentions for achieving its goals or reacting to some 
observed situation. 

The basic structure of PRS is shown in Figure 1. A brief description of each component and its usage is given below. 3 Later sections will give examples of 
PRS use in the the robot scenario. 

1 Flakey m being used in • variety of ripen menu at SRI, and PRS is just one of various lyiltns being employed for controlling Flakey. 

7 A more formal description of PRS may be found in [17]. 
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Figure 1: System Structure 


3.1 The System Data Base 

The content* of the PRS data hue may be viewed a* representing the current beliefs of the system. Some of these beliefs may be provided initially by the 
system user. Typically, these will include facts about static properties of the application domain — for example, the structure of some subsystem, or the 
physical laws that some mechanical components must obey. Other beliefs are derived by PRS itself as it executes its KAs. These will typically be current 
observations about the world or conclusions derived by the system from these observations. Consequently, at some times the system may believe that it is 
in a particular hallway, and at other times, in another. Updates to tbs data baas therefore necessitate the use of consistency maintenance techniques. 

The data base itself consist* of a set of state description* describing what is [believed to be] true at the current instant of time. We use first-order predicate 
calculus for the state description language. Free variables, represented by symbols prefixed with I, are assumed to be universally quantified. The statement 

(V <-» (on lx table)) (red (color lx))) 

for example, represents states of the world.in which every object on the table is red. Data base queries are done using unification over the set of data base 
predicates. 

State descriptions that describe interns/ system states are called metalevel expressions. The basic metalevel predicates and functions are predefined by the 
system. For example, the metalevel expression (goal g) is true if g is a current goal of the system. 

3.2 Goals 

Goals appear both on the system goal stack and in the representation of KAs. Unlike most AI planning systems, PRS goals represent desired behaviors 
of the system, rather than static world states that are to be (eventually) achieved. Hence goals are expressed as conditions on some interval of time (i.e., 
some sequence of world states). 

Goal behaviors may be described in two ways. One is to apply a temporal predicate to an n-tuple of terms. Each temporal predicate denotes an action t*pc 
or a set of state sequences. That is, an expression like -(walk a b)" can be considered to denote tbe set of state sequences which embody walking actions 
from point a to b. 

A behavior description can also be formed by applying a temporal operator to a state description. Three temporal operators are currently used. The 
expression ( fp), where p is some state description (possibly involving logical connsctives), is true of a sequence of states if p is true of the last state in 
the sequence; that is, it denotes those behaviors that achieve p. Thus we might use the behavior description (t (walked a b)) rather than (walk a b). 
Similarly, (?p) is true if p is true of the first state in the sequence - that is, it can b« considered to denote those behaviors that result from a successful test 
for p. Finally, (#p) is true if p is preserved (maintained invariant) throughout the sequence. 

Behavior descriptions can be combined using the logical operators A and V. Thess denote, respectively, the intersection and union of the composite 
behaviors. 

As with state descriptions, behavior descriptions are not restricted to describing the external environment, but can also be used to describe the internal 
behavior of the system. Such behavior specifications are called metalevel behavior specifications. One important metalevel behavior is described by an 
expression of the form (*> p). This specifies a behavior that places tbs state description p in the system data base. Another way of describing this behavior 
might be (!(he/te/ p)). 

3.3 Knowledge Areas 

Knowledge about how to accomplish given goals or react to certain situations is represented in PRS by declarative procedure specifications called knowledge 
areas (KAs). Each KA consists of a bodg f describing the steps of the procedure, and an invocation condition that specifies under what situations the KA is 
useful. 
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Tba body of a K A is represented as a graphical network and can be viewed as a plan or plan schema. However, it differs in a very important way from 
the plans produced by most AI planners: it does not consist of possible sequences of primitive actions, but, rather, of possible sequences of nk feelr to be 
achieved. Thus, the bodies of KAs are much more like the high-level "operators” used in planning systems such as NOAH [28] and SIPE [34). They differ in 
that (1) the subgoals appearing in the body can be described by complex temporal expressions (i.e., the goal expressions described in the preceding section), 
and (2) the allowed control constructs are much richer, and include conditionals, loops, and recursion. One important advantage of using abstract subgoals 
rather than fixed calls to actions is that the knowledge expressed in any given KA is largely independent of other KAs, thereby proving a very high 
degree of modularity. It is thus possible to build domain knowledge incrementally, with each compooent KA having a well-defined and eerily understood 
semantics. 

The invocation part of a K A contains an arbitrarily complex logical expression describing under what conditions the KA is useful. Usually, this consists of 
some conditions on current system goals (in which case, the KA is invoked in a goal-directed fashion) or current system beliefs (resulting in data-directed or 
reactive invocation), and may involve both. Together, the invocation condition and body of a KA express a declarative fact about the effects of performing 
certain sequences of actions under certain conditions. 

The set of KAs in a PRS application system consists not only of procedural knowledge about a specific domain, but also includes metalevei KAs — that is, 
information about the manipulation of the beliefs, desires, and intentions of PRS itself. For example, a typical metalevel KA would supply a method for 
choosing between multiple relevant KAs, or how to achieve a conjunction of goals, or how much further planning or reasoning can be undertaken given the 
real-time constraints of the problem domain. These metalevel KAs may, of course, utilise domain-specific knowledge as well. In addition to user-supplied 
KAs, each PRS application contains a set of system-defined default KAs. These are typically domain-independent metalevel KAs. 

3.4 The System Interpreter 

The system interpreter runs the entire »y«tem. From a conceptual viewpoint, it operates in a relatively simple way. At any particulsr time, certain goals 
are active in the system, and certain beliefs are held in the system data base. Given these extant goals and beliefs, a subset of KAs in the system will be 
relevant (applicable). One of these KAs will then be chooen for execution. 

In the course of traversing the body of the chosen KA, new subgoals will be posted and new beliefs will be derived. When new goats are pushed onto the 
goal stack, the interpreter checks to see if any new KAs are relevant, and executes them. Likewise, whenever a new belief is added to the data base, the 
interpreter will perform appropriate consistency-maintenance procedures and possibly activate new applicable KAs. During this process, vanous metalevel 
KAs may also be called to make choices between alternative paths of execution, to choose between multiple applicable KAs, to decompose composite goals 
into achievable components, and to make other decisions. 

This results in an interleaving of plan selection, formation, and execution. In essence, the system forms a partial overall plan (chooses a KA), figures out 
near term means (tries to find out how to achieve the first subgoal), executes them, further expands the near-term plan of action, executes further, and 
so on. At any time, the plans the system is intending to execute (i.e., the selected KAs) are both partial and hierarchical — that is, while certain general 
goals have been decided upon, specific questions about the means to achieve these ends are left open to future deliberation. 

This approach has many advantages. First, systems generally lack sufficient knowledge to expand a plan of action to the lowest levels of detail - at least if 
the plan is expected to operate effectively in a real-world situation. The world around us is simply too dynamic to anticipate all circumstances. By finding 
and executing relevant procedures only when needed and only when sufficient information is available to make wise decisions, the system stands a better 
chance of achieving its goals under real-time constraints. 

Because the system is repeatedly assessing its current set of goals, beliefs, and the applicability of KAs, the system also exhibits a very reactive form of 
behavior, rather than being merely goal-driven. By reactive, we mean more than a capability of modifying current plans in order to aceompfish given goals; 
a reactive system should also be able to completely changt its focui and pursue new goals when the situation warrants it. This is essential for domains in 
which emergencies can occur and is an integral component of human practical reasoning. 

Because PRS expands plans dynamically and incrementally and also allows for new reactive KAs to respond when they are relevant, there are frequent 
opportunities for it to react to new situations and to change goals. The system is therefore able to modify its intentions rapidly on the basis of what it 
currently perceives as well as upon what it already believes, intends, and desires. It can even change its intentions regarding its own reasoning processes - 
for example, the system may decide that, given the current situation, it has no time for further reasoning and must act immediately. 


3.5 Multiple Asynchronous PRSs 

In some applications, it is necessary to monitor and process many sources of information at the same time. PRS was therefore designed to allow several 
instantiations of the basic system to run in parallel. Each PRS instantiation has its own data base, goals, and KAs, and operates asynchronoosly with other 
PRS instantiations, communicating with them by sending messages. The messages are written into the data base of the receiving PRS, which must then 
decide what to do with the new information, if anything. 

Typically, this decision is made by a fact-invoked KA (in the receiving PRS), which responds upon receipt of the external message. On the basis of such 
factors as the reliability of the sender, the type of the message, and the beliefs, goals, and current intentions of the receiver, it is determined what to do 
about the message - for example, to acquire a new belief, establish a new goal, or modify intentions. 

We have found the ability to perform multiple activities concurrently to be crucial in the robot domain. Although some systems do generate plans, portions 
of which can be executed in parallel (e.g., NOAH [28] and SIPE [34]), our motivations for parallelism are quite different. In our case, ibe parallelism 
is essential for processing the constant stream of sensory information and for controlling devices continuously. That is, parallelism is required for the 
system’s proper operation. In NOAH and SIPE, however, the parallelism is simply fortuitous and does not result from any demands on processing speed 
or distributed functionality. 


4 Flakey the Robot 

Flakey was designed and built within SRl’s Artificial Intelligence Center, and is being used by several research teams to test software-organisation ideas. 
It contains two onboard computers, a SUN II workstation (with 12Mb disk) and a Z80 microprocessor. The Z80 is the low-level controller, receiving 
instructions from, and returning current information to, the SUN. The SUN, in turn, can be connected to an ethernet cable, allowing the robot to operate 
in either stand-alone or remote-control modes. The SUN can also be accessed from a small console on Flakey itself. 

The Z80 manages 12 sonars, 16 bumper contacts, and 2 stepper motors for left and right wheels. Voice output and video input are managed by the SUN. 
A robot arm will be added in the future. The application described here uses only the sonars, voice, and wheels. 
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Figure 2: Top-level Strategy 

The 12 sonars are located approximately 5 inches off the ground, 4 facing forward, 4 backward, and 2 on each side. To obtain a sonar reading, the SUN 
must issue a request to the Z80 and then wait until the result has been returned. While waiting, the SUN can continue with other processing. At present, 
the SUN can obtain no more than a few sonar readings per second. 

The motors for the left and right wheels can be controlled independently, again by having the SUN send a request to the Z80. For each wheel one can specify 
a desired distance, a maximum forward speed, and a desired acceleration. The Z80 uses the given acceleration to achieve the maximum speed compatible 
with the desired distance. 

Changing direction is done by requesting different speeds for the two wheels. When the robot is stationary, this can be reduced to a simple rotation; when 
the robot is moving, more complex algorithms are required. Direction changes are much more difficult when they must be negotiated during a forward 
acceleration. 

As welt as receiving the desired values of distance, speed, and acceleration from the SUN, the Z80 transmits current actual values to the SUN. Th.s is done 
using interrupts that occur at a rate of approximately fifty times per second. The ZSO also runs a position iutegrator, thus making available the robot's 
position and orientation relative to particular reference axes. In line with our wish to avoid reliance on dead reckoning, however, we did not use the position 
integrator for the top-level navigation task; it was used, however, for such low-level tasks as estimating the robot’s alignment within a hallway. 

There is significant noise in every measurement available to the SUN. The sonars, while generally accurate to about 5 millimeters, can occasionally return 
invalid readings and can also fail to see an object if the angle of incidence is high enough. Furthermore, Fl&key’s sonars sense the closest object within a 
30-degree cone, so that open doorways are not seen until the sonars are well past the doorpost. Similarly, Flakey will stop within about 5 millimeters of 
the requested distance and will travel at speeds which fluctuate up to 10 millimeters/second above and below the requested maximum speed. 


5 The Domain Knowledge 

The scenario described in the introduction includes problems of route planning, navigation to keep on route, and various general tasks such as 
handling and requests for information. In this paper, we will concentrate on the route planning and navigation tasks. However, it is important to realise 
that the knowledge representation provided by PUS is used for reasoning about all tasks that the system performs. 

The way the robot (that is, Flaxey under the control of PRS) solves the tasks of the space station scenario is roughly as follows. To reach a particular 
destination, it knows that it must first plan a route and then navigate that route to the desired location (see the KA depicted in Figure 2). In planning 
the route, the robot uses knowledge of the topology of the station to work out a route to the target location, as is typically done in navigation tasks for 
autonomous robots [6,7,22]. The topological knowledge is of a very high-level form, stating which rooms are in which corridors and how corridors are 
connected. A plan formed by the robot is also high-level, typically having the following kind of form; “Travel to the end of the corridor, turn right, then go 
to the third room on the left.” The robot's knowledge of the topology of the problem domain is stored in its data base, and its knowledge of bow to plan a 
route is represented in various route-planning KAs (see Figures 4. 5, and 6). This is quite different from the approach adopted by traditional AI planners, 
whieh would find a route by symbolically executing the actual operators specifying possible movements through rooms and down hallways. 

A different set of KAs is used for navigating the route mapped out by the route-planning KAs (see Figures 7, 8, and 9). The navigation KAs perform such 
tasks as sensing the environment, determining when to turn, adjusting bearings where necessary, and counting doors and other openings. 

Yet other KAs perform the various other tasks required of the robot. Many of these are described by us elsewhere [17]. Metalevel KAs choose between 
different means to realize any given goal, and determine the priority of tasks when mutually inconsistent goals (such as diagnosing a jet failure and 
fetching a wrench) arise. If the robot’s route plan fails, .the route-planning KAs can again take over and replan a route to the target destination. In the 
implementation described herein, however, we have not provided any KAs for reestablishing location once the robot has left its room of departure, and so 
it does not currently exhibit any replanning capability. 
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5.1 The Planning Space 

Am slated above, the robot’s route planning is done in a very abstract space containing only topological information about how the rooms sad hallways 
connect. It is, in fact, the kind of map found in street or building directories, stripped of precise distances and angles. This is quite natural; when ont 
thinks of going home from the office, one considers primarily the topology of the hallways, footpaths, aad roads to be followed, not precisely how long each 
is, nor the consequences of drifting from side to side — that is too low a level of detail to be considered before setting out along the chosen roate. 

The three primary KAs used to plan paths are shown in Figures 4, 5, and 6. Given knowledge of the start- and end-points, they first select some intermediate 
point. They then repeat the process for the two resulting subpaths until all paths are reduced to straight-line trajectories along single hallways. Although 
it is not the planner we would advocate for more general route planning, it is quite sufficient for our purposes. Indeed, the top-level route planning is 
probably the simplest aspect of the navigation task. 

The topological information needed for route planning is stored in the system data base as a set of facts (beliefs) about how wings, hallways, and rooms 
are connected. These include facts of the form (conn jl kl j4 direct) (hallway jl is connected to hallway kl DIRECTLY via hallway j4 rather thaw 
indirectly via yet further connections), (in-wing jl jwing) (hallway jl is in wing j wing), and (in-hall *j225 jl east 14) (room q22S is is hall jl am 
the east side of the hall, fourteen rooms from the end). A typical plan constructed by the path-planning KAs is shown in Figure 3. This plan was formed 
to satisfy the goal of reaching a target room (cj270 in wing j2) from the robot’s present location (ej233 in wing jl) and was produced in less than a s ec ond. 
No further predictive planning is required for the robot to negotiate the path. 


(follow-path hall q233 jl) 
(follow- path ball jl j4) 
(foUow-path hall j4j2) 
(follow-path hall j2 *270) 


Figure 3: Route from ej233 to ej270 

It is important to emphasise that, even during this relatively short planning stage, the robot remains continuously reactive. Thus, for example, dkould the 
robot notice indication of a jet failure on the space station, it may well decide to interrupt its route planning and attend instead to the task of remedying 
the jet problem. 

5.2 Reactive, Goal-Directed Behavior 

The KAs used to navigate the route fall into three claaes; those that interpret the path plan and establish intermediate target locations, thorn that are 
used to follow the path, and those that handle critical tasks such as obstacle avoidance and reacting to emergencies. Each KA manifests a setfcootaiord 
behavior, possibly including both sensory and effector components. Moreov er , the set of KAs is naturally partitioned according to level of functionality (c C 
[7]): low-level functions (emergency reactions, obstacle avoidance, etc.), middle-level functions (following already established paths and trajectories), and 
higher-level functions (figuring out how to execute a topological route). All of these KAs are simultaneously active, performing their function whenever 
they may be applicable. Thus, while trying to follow a path down a hallway, an obstacle avoidance procedure may simultaneously cause the robot to veer 
slightly from its original path. 

Once a plan is formed by the route-planning KAs, that plan must be converted into some useable form. Ideally, the plan shown in Figure 3 should be 
represented as a procedural KA containing the goals “leave room ej233 and go into hall jl,” “go to the j I -j4 junction,” etc. Since it is not currently possible 
for KAs to create or modify other KAs, we have, instead, defined a group of KAs that react to the presence of a plan (in the data base) by translating it 
into the appropriate sequence of subgoals. Each leg of the original plan generates subgoals such as turning a corner, travelling the hallway, and updatiw* 
the data base to indicate progress. The second group of navigation KAs reacts to these goals by actually doing the work of reading the sonars, ■fterpretiag 
the readings, counting doorways, aligning the robot within the hallway, and watching for obstacles ahead. 

For example, consider the KAs in Figures 7 and 8. After having planned out a path as directed by the KA in Figure 2, the robot is given a goal of the form 
( ! (rooa-lsft tfroow)) (the variable tfroow will be bound to some particular constant representing the room that the robot is trying to leave). The 
KA in Figure 8 will respond and actually perform steps for leaving the given room. The last step in this KA will insert a fact into the system database of 
the form (origin tfroow tffcall) (again, the variables will be bound to specific constants). This fact alerts a path interpretation KA (depicted in Figure 
7) that the robot is now ready to execute a leg of its path, and supplies the KA with the robot’s starting position (i.e., the room adjacent to the robot, 
tfroow. and the hall in which it stands. If hall). Assuming that the facte describing a path have been placed in the database (for example, the set of facte 
in Figure 3), the fact-invoked FIMD-IEXT K A in Figure 7 will reapood and begin to interpret the path. It will then proceed and travel down the hallway aw 
instructed. This will in tom establish a new origin position, thereby allowing for the next step of the path to be executed. 

A third group of KAs reacts to contingencies observed by the robot as it interprets and executes its path. For example, these will include KAs that respond 
to the p r e s e n ce of an obstacle ahead (see Figure 9) or the fact that an emergency light has been seen. These reactive KAs are invoked solely on the basis 
of certain facts becoming known to the robot. Implicit in their invocation, however, is an underlying goal to “avoid obstacles* or “remain safe.* 

Since a fact-invoked KA can be executed as soon as its triggering facts are known, the KAs invoked by these contingencies can interrupt whatever else 
is happening. Of course, this may not always be desirable. Ideally, domain-specific metalevel KAs should determine whether and when preemption in 
desirable, but, at this stage of the project, we have not used metalevel KAs betides those provided as PRS defaults (which give immediate preemption). 
An alternative to preemption is to send a contingency message to another PRS instantiation that can process that message in parallel. 


5.3 Parallelism and Mediation 

Because of the real-time constraints and the need for performing several tasks concurrently, it is desirable to use multiple instances of PRS running in 
parallel. In particular, parallelism can be used for handling contingencies without interrupting other ongoing tasks. Multiple PRS instantiations can also 
be used as information filters to protect other instantiations from a barrage of uninteresting sensory information. (The need for such filters anses in many 
problem domains - for example, in monitoring sensors on the space shuttle [4).) The strongest reasons, however, have to do with the inherently parallel 
and largely independent nature of the various computations that most be performed in dynamic environments. 

For example, as the robot roils down a hallway, it fixes its sonars to determine how far it is from the walls, and also to count doors. Suppose it derides that 
the walls are too dose and a change in course is warranted. Because speed changes cannot be accomplished instantaneously, changing coarse may take aw 
long as two seconds. This is long enough for the robot to roil past a doorway. If the procedure that monitors sonar readings is interrupted to effect the 
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Figure 6: Path Planning KA 
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Figure 7: Plan Interpretation KA 
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Figure 9: Reactive KA 
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course change, the robot might completely miss a door reading. Conversely, delaying course changes for the sake of sonar monitoring could make a collision 
with a wall inevitable. Of course, travelling at lower speeds would solve the problem, but would also render the robot too slow to be useful. 

The moat effective way to handle this problem is to allow multiple PRS instantiations to execute concurrently. Running several instantiationsaayncbronously 
has its own problems, however. For example, it is desirable to have one PRS instantiation devoted to the taak of keeping the robot in the center of the 
hallway, with another driving the robot to the target location and adjusting speed appropriately (e.g., slowing down when approaching the target). Changes 
in course are effected by changing the relative velocities of the two wheels, depending on their current velocity, and changes in speed by changing the 
accelerations of the wheels. The problem is that, if both tasks need to be performed at once, the required wheel operations may interfere with one another. 
This is an interesting example of a situation in which domain-independent decomposition operators will not work - because of the real-time constraints of 
the problem domain, it is not suitable to achieve one goal (say, a change in direction) and subsequently achieve the other (change in speed); neither can 
each goal be achieved independently, as the means for accomplishing these goals interact with one another. 

To mediate between interacting goals, we chose to implement a third PRS capable of accepting both speed and direction change requests asynchronously. 
This PRS could be viewed as a virtual controller. Because the virtual controller is in complete control of the wheels, it can issue instructions that achieve 
both kinds of requests at once. In this respect, it serves as a special-purpose solution to a particular kind of conjunctive goal; goals to change both speed 
and direction are decomposed into independent goals to change the left and right wheel speeds. 

Related to the problem of interacting goals is that of goal conflict; just as one may have possibly conflicting beliefk about a situation that need to be 
resolved (the problem of situation assessment), one may also have conflicting goals (or desires) that need mediation [18]. For example, the virtual controller 
discussed above often gets conflicting speed requests from KAs: the hallway traversal KA might request that a certain velocity be maintained, the KA that 
delects approach of the target location may request a decrease in velocity, and the KA that detects obstacles could request that the robot stop altogether. 
At the same time, other KAs might request changes in direction to stay in the center of the hall or to pass around small obstacles. 

To resolve these conflicting goals, the virtual controller has to be able to reason about their urgency and criticality. This, in turn, may involve further 
communication with the systems requesting these goals. Our present solution is to define domain-dependent mediators where necessary, but. at present, no 
general approach to this problem has been attempted. 

5.4 Coping with Reality 

Our initial implementation of the robot application used multiple PRS instances interacting with a robot simulator, all running on the Symbolics 3600 
This worked well, and demonstrated the suitability of the system for controlling complex autonomous devices. That don*, we began work on driving the 
real robot. This transfer took considerably longer than estimated. Two major problems caused this divergence between expectations and reality. 

First, because PRS was implemented on a Lisp machine, interaction with Ftakey was confined to occur via an ethernet cable. Software for remote procedure 
calls over the ethernet limited communication to 15 function calls per second - too slow for timely response to sensor input. Consequently, we were forced 
to transfer much of the functionality of PRS to Flakey’s SUN. This required translating the functionality of the lower-level KAs into C code, as well m 
explicit coding for message and clock-signal handling. Unfortunately Flakey's operating system also did not support interprocess communication at the 
bandwidth and efficiency we wanted. This forced us to implement communication through shared memory, with all the concojnmitanl synchronisation 
code needed. After these efforts, the information flowing over the ethernet was at the level of “move N doors" (PRS to Flakey) and Tm stopping for aa 
obstacle" (Flakey to PRS). Obviously, the translated system is no longer solely constructed from instances of PRS. As a result, our final implementation ■ 
considerably more constrained than the simulation version in its ability to reason about its low-level actions and to react appropriately to changing goals 

The second obstacle to translating from our simulated application to the one that could function in the physical world is the nature of the real world itself 
A realistic environment is simply not controlled enough to foster efficient debugging. It is hard to repeat experiments (and get the same bup), time delays 
become critical, and the behaviors of real sensors and effectors can differ significantly from simulated ones. 

The configuration of our current application system is shown is Figure 10. Three machines are involved, a Symbolics 3600, a SUN, sod s Z80, running 
six application processes. The wheels and sonars are also depicted, and may be regarded as physical processes. The rectangular box represents the SUN*s 
shared memory area; arrows represent interprocess communication. 



Figure 10: Processes Used in the Implementation 


6 Discussion 

The primary purpose of this research was to show that the BDI architecture of PRS, the partial hierarchical planning strategy it supports, and its metalevd 
(reflective) capabilities could he effective in real-world dynamic domains. Furthermore, the design of PRS .nerts some of the more important desiderata 
for autonomous systems: modularity, awareness, and robustness [18]. In this section, we will briefly compare our approach to other work in the areas '/ 
planning and robotics. 
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The partial hierarchical planning strategy and the reflective reasoning capabilities used by PRS allow many of the difficulties associated with traditional 
planning systems to be avoided, without denying the ability to plan ahead when necessary. By finding and executing relevant procedures only wbea sufficient 
information is available to make wise decisions, the system stands a better chance of achieving its goals under real-time constraints. 

For example, the speed and direction of the robot is determined during plan execution, and depends on such things as proximity of obstacles and the actual 
course of the robot. Even the method for determining this course depends dynamically on the situation, such as whether the robot is between two hallway 
walls, adjacent to an open door, at a T-intersection, or passing an unknown obstacle. Similarly, the choice of how to normalise fuel or oxidant tank pressure 
while handling a jet failure depends on observations made during the diagnostic process. 

Because PRS expands plans dynamically and incrementally, there are also frequent opportunities to react to new situations and changing goals. Tbs system 
is therefore able to modify its intentions (plans of action) rapidly on the basis of what it currently perceives is well as upon what it already believes, intends, 
and desires. For example, when the system notices a jet- fail alarm while it is attempting to fetch a wrench, it has the ability to reason about the priorities 
of these tasks, and, if so decided, suspend the wrench* fetching task while it attends to the jet failure. Indeed, the system even continues to monitor the 
world while it is route planning (in contrast to most robot systems), and this activity too can be interrupted if the situation so demands. 

The powerful control constructs used in PRS procedure bodies (such as conditionals, loops, and recursion) are also advantageous. As a result, the robot 
can display behaviors of the form “do X until B becomes true." When X is "maintain speed at 400mm/sec" and B is “N doorways have been observed’ we 
see why we could dispense with coordinate grids and dead reckoning: we could define the robot's behaviors in terms of conditions that changed over time. 
In contrast, classical planning systems often have difficulty in reasoning about such behavior and are thus restricted to using unchanging features such as 
fixed locations or distances. 

PRS is also very robust in that there may be many different KAs available for achieving some given goal. Each may vary in its ability to accomplish the 
goal, and in its applicability and appropriateness in particular situations. Thus, if there is insufficient information about the current situation to allow one 
KA to be used, some other * perhaps less reliable - KA may be available instead. For example, if a topological map of an area is unavailable for planning 
purpoees, the robot need not be rendered ineffective - there may, for example, be some other KA that seta the robot ofT in the general direction of the target. 
Parallelism and reactivity alao help in providing robuatneaa. For example, if one PRS instantiation is busy planning a route, lower-level instantiations can 
remain active, monitoring changes to the environment, keeping the robot in a stable configuration, and avoiding dangers. 

The system we propose alao meets many of the criteria of rational agency advanced in the philosophical literature on practical reasoning (e g., see the 
work of Bratman [5)). Driven by the demands of explaining resource- boundedness and inter- and iura-agent coordination, recent work in the philosophy 
of action has moved beyond belief-desire architectures for rational agents and has provided insights into the nature of plans and intentions, and especially 
the nature of intention formation. 

In particular, plana are viewed aa being subject to two kinds of constraints: consistency constraints and requirements of means-ends coherence. That is, an 
agent's plans need to be both internally consistent and consistent with its beliefs and goals. It should be possible for an agent’s plans to be succemfully 
executed (that is, to achieve the more important goals of the system) in a world in which its beliefs are true. Secondly, plans, though partial, need to be 
filled in to a certain extent as time goes by, with sub-plans concerning means, preliminary steps, and relatively specific courses of action. These subplans 
must be at least aa extensive aa the agent believes is required to successfully execute the plan; otherwise they will suffer for means-ends incoherence. 

These constraints on the beliefs, desires (goals), and intentions of an agent are realised by the system proposed herein, and as such it can be viewed aa an 
implementation of a rational agent. In addition, the notion of intention we use meets the major requirements put forward by Bratman (5), who considers 
intentions to have the following properties: 

• They lead to action, 

• They are parts of larger plans, 

• They involve commitment. 

• They constrain the adoption *1 other intentions, 

• They are adopted relative to the beliefs, goals, and other intentions of the system. 

Of course, our system is far from manifesting the behavioral complexity of real national agents; however, it is a step in the direction of a better understanding 
of rational action. 

In contrast to most A! planning work, research in robotics has been very concerned with reactivity and feedback (2,18.23). However, most of this work 
has not been concerned with general problem solving and commonsense reasoning - the work is almost exclusively devoted to problems of navigation and 
execution of low-level actions. Furthermore, the reactivity is not of the general form we advocated above; although the systems can adjust the mesas for 
achieving given goals depending on incoming sensory information, they do not exhibit the ability to completely change goal priorities, to modify, defer, or 
abandon current plana, or to reason about what is best to do in light of the current situation. 

Recently, Brooks [7] has advanced some intriguing ideas concerning the structure of autonomous systems. Rather than the horizontal structure typical of 
most robot systems (where lower levels are restricted to performing basic sensory and effector processing, and the higher levels to planning and reasoning) 
Brooks advocates a vertical decomposition in which distinct behaviors of the robot are separately realized, each making use of the robot's sensory, effector, 
and reasoning capabilities as needed. 

Similarly, PRS provides a vertical, rather than horizontal, decomposition of the robot task domain. Each KA defines a particular behavior of the system, 
and can involve both processing of sensory information and the execution of effector actions. For example, there is a KA that manifests a behavior to 
remain clear of obstacles, another KA whose behavior corresponds to keeping a straight course in a corridor, and yet another that chooses and traverses 
routes from one room to another. All these KAs use both sensors and effectors to greater or lesser degrees - there is no single subsystem that preprocesses 
the sensory data before sending it to the reasoning system, and there is no post-processing of plan information that determines actual effector actions. 

In this sense, our system is very similar in structure to that proposed by Brooks - indeed, it can claim the same positive benefits (8): 

s There are many parallel paths of control through the system [many different procedures can be used in a given situation) Thus the performance of 
the system in a given situation is not dependent on the performance of the weakest link in that situation. Rather, it is dependent on the strongest 
relevant behavior for the situation 

• Often more than one behavior is appropriate in a giv»*n situation. The fact that the behaviors are (can be] generated by parallel systems [multiple PRS 

instances) provides redundancy and robustness to the overall system There is no central bottleneck [through which all the processing or reasoning 
must occur] # 
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• With 90tM dincipUM in structuring Um daeompoaitioo, tht Individual taak-ach Wring behavior* cm run on MparaU piece* of hardware. Thus [the 
design] Wed* to a natural mapping of Um compWt* inUUiftnt system onto a parallel machine. The benefit* are threefold: (1) redundancy again; (2) 
ep endup; and (3) a naieraJly extensible system. 

The main difference between onr syatem and that advanced by Brooke ie that we employ a much more general mechaoiem for ejecting between appropriate 
behaviora than he doee: whereee Brooks uses inhibitory and excitatory links to integrate the aet of behaviors defined by each of the system's functional 
components, we use general metalevel procedures and communication protocols to perform the selection and integration. Of course, each generality will 
likely preclude meeting some of the real-time constraints of the environment, in which case the metalevel procedures might need to be compiled into a farm 
closer to that envisaged by Brooks. Similarly, while our system naturally maps onto eeerrc-grsis parallel machines, sophisticated compilation techniques 
would be required to map the lower-level functions onto highly parallel architectures. 

Currently, PRS does not reason about other subsystems (i.e., other PRS instantiations) In any but ths simplest ways. However, the m es sa ge -passing 
mechanisms we have employed should allow us to integrate more complex reaming about interprocess communication, such aa described by Cohen and 
Levesque [9). Reasoning about process interference and synchronisation is also important where concurrency is involved. The mechanisms developed by 
us for reeeoning about these problems [12,13,14,19,30,31] could also potentially he integrated within PRS. Our future research plana include both work on 
communication and synchronisation within the PRS framework. 

Finally, in giving a description of the PRS architecture, it it important to note that the actual •mp/cfnrnfefWa of PRS ia not of primary concern. That 
is, while we believe that attributing beliefs, desires, and intentions to autonomous systems can aid in specifying complex behaviors of these systems, and 
can assist in communicating and interacting with them, we are not demanding that such systems actually be structured with distinct data structures that 
explicitly represent these psychological attitudes (although, indeed, that ia the way we have chosen to implement our syatem). We caa instead view the 
specification of the PRS syatem, together with the various metalevel and object-level KAs, simply as a description of the desired behavior of the robot. 
This description, suitably formalised, 1 could be realised in (or compiled into) any suitable implementation we choose. In particular, the beliefs, desires, and 
intentions of the robot may no longer be explicitly represented within the system. Some interesting work on this problem is being carried out currently bv 
Rosenschein and Kaelbling [26] 


7 Limitations 

The primary thrust of this work has been to evaluate an architecture for autonomous systems (hat provides a means of achieving goal-directed, yet reactive, 
behavior. We have made enough progress to show that this approach works. However, the research is only in its initial stages and there are g number of 
limitations that still need to be addressed. 

First, there is a class offsets our current system must be told; for instance, the robot's starting location. If the robot is initialised in some unknown position 
on the topological map, the planner will abort. It would be straightforward to solve these problems by including KAs that ask for the missing information, 
hut a completely autonomous recovery would be a much more challenging problem. Possible approaches might involve exploration of the terrain (including 
movement around the neighboring area) and pattern matching onto known topological landmarks. , 9 

Second, there are many assumptions behind the procedures (KAs) used. For example, we have assumed that hallways are straight and corners rectangular; 
that ail hallways are the same width and have that width for their entire length (except for doorways and intersecting hails); that there is only one layer 
of obstacles in front of any wail (nowhere is there a garbage can in front of a cupboard); that all doors are open and unobstructed; and that other objects 
move much slower than the robot. 

We have also made assumptions that limit the robot's reactivity. For example, we assume that the robot does, in fact, arrive at the junction it planned to 
reach. If the robot miscounts doorways, it will stop in the wrong place, turn, and start the next leg of its journey without realising iu mistake The result 
is generally that the robot will be found begging a wall to "please make way." If the robot realised it waa in the wrong position, it could replan to achieve 
its goals. However . because we assume that the door count is always right, the route planner ia never reinvoked. 

In addition, some goals are not made as explicit as one would like, but are implicit in the KAs used by the robot. For example, the robot ia designed 
to move as fast as possible without miscounting doorways and to travel along the center of the hallway while accepting the fact that this ideal will 
rarely he achieved. Such goals are not represented explicitly within KAs. Handling the first kind of goal (“move as fast as possible") would be relatively 
straightforward, requiring simply that axioms relating robot speed and perceptive capabilities be provided to the system. However, it is not obvious how 
to explicitly represent the second kind of goal, in which the system attempts to maintain a particular condition but expects at best only to approximate it. 

Finally, increaaed parallelism would have been preferable, allowing the robot to perform more tasks concurrently. For example, we could have included many 
more low- level procedure* for. **y. avoiding dangers and exploring the surroundings. This would have provided a much more severe test of the system’s 
capability to coordinate various plana of action, to modify intentions appropriately, and to change its focus of attention. 
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Robots must be able to ruuction in the real world. >The real world involves processes 
and agents that move independently of the actions of the robot, sometimes in an unpredictable 
manner. T h is pap e r pr ese n ts I real-time integrated rojRe planning and spatial representation 
system for planning routes through dynamic domains/ The system will find the safest most 
efficient route through space-time as described by a set of t£er defined evaluation functions. 
Because the route planning algorithm is highly parallel and can run on an SIMD machine in 0(p) 
lime (p is the length of a path), the system will find real-time paths through unpredictable 
domains when used in an incremental mode. Th is p ap er w i l l dis cu ss: Spatial representation, an 
SIMD algorithm for route planning in a dynamic domain, and results from an implementation on 
a traditional computer at chi lecture c — *+**~x^ , 

1 - Introduction: Route Hanning in Dynamic Domains 


The ability to represent and plan movements through space is necessary for any autonomous 
mobile robot. Mechanical error and uncertainty make it impractical to maneuver a robot through a 
series of complex tasks strictly by dead-reckoning. If dead-reckoning is of limited use, then some 
navigation capabilities must be brought into play; navigation depends on having some knowledge of 
the world outside of the robot. Towards this end, a wide variety of spatial representation systems 
have been developed in recent years. 

A variety of techniques have been used to attack different aspects of the spatial representation 
problem. Topological graphs lLaumoud83|, lChatila83] have been used for guiding route planning 
through a loosely connected set of convex polygons representing free-space areas in an indoor 

environment. Regions mapped with traversable conduits lMcDermou84] have been used successfully 
for large scale navigation in uncertain environments. Representation of the exteriors of obstacles as 
the edges of a highly connected graph was used by Davis, allowing detailed knowledge of the 

environment and its accompanying uncertainty to be captured (Davi$84). 

Other representations have been used Tor capturing movement or navigational details necessary 
for a robot to plan its nctivilcs. Configuration space ILozano-Perez83) provides a computationally 
tractable approach to calculating the practical steps for moving a robot from one position to another. 
Using Voronoi diagiams and representations of Tree space, movements in three-dimensions have been 
calculated to maintain a robot the maximal possible distance from any obstacle [Brooks82j. Similar 

methods, when combined with an analysts of the robots sensors, can calculate a path that is both 

relatively safe and easy to navigate | Millet 85 J. 

4; j 41 


im '(T INTFWf HW*UY RttMl 


Despite the variety of the techniques mentioned above, a!) of the systems discussed share some 
basic limitations. None of the systems takes into account the quality of the surface upon which the 
robot travels, relying on the surface being either traversable or not. Such a restricted view is 

continually contradicted by the way people move about. People stray off the sidewalk or jay-walk 
across a street whenever it is convenient and safe, hence a realistic robot should be able to behave 
similarly. A further limitation is that all the aforementioned systems are designed to operate under 

static conditions, where the only aspect of the world that changes is the position of the robot. This is 
an unrealistic and unacceptable limitation for almost all applications. 

In addition to being able to function in a dynamic world, a robot should be able to reason about 

dynamic processes and how they may affect it. For example, if a robot knows the local train schedule 

and needs to get to the other side of the train tracks, then it should use that information when 

planning to get to the other side. If the robot has information predicting that a long freight train will 

be coming just before it can reach the tracks, then given the choice between a short path that involves 

crossing the train tracks, and a slightly longer plan to go under the tracks, the robot should choose 
the latter plan. Similarly, if the robot's task is to rob a train, then the ability to plot a path that will 
allow the robot to jump onto the moving train is necessary. 

Unpredictable dynamic processes must also be taken into account during route planning. A 
cavalry robot that "fears* an attack by a tribe of Indian robots would be belter off planning to get to 
the fort across the open plain, rather than passing through the narrow passageway of Ambush Canyon . 
Hie primary reason for this is an attack in the canyon would more effectively block the robot from 
its destination, thereby mandating backtracking. 

The single property that most distinguishes this work from previous systems is that it models 
not only space, but time as well. Rather than making a calculation about whether the robot can 
traverse a particular area independent of time, this system models the ability of the robot to 

traverse that area at different times. We have accounted for temporal as well as spatial changes in 
* the environment. The message passing technique used allows time to be considered while allowing 
the system designer to muuct qualities of the domain, such as the cost of moving front one position to 
another and the ease of traversing a particular area of space. The remainder of this paper describes a 
representation and route planning system for use in unpredictable dynamic domains. 

2 - The Algorithm 

This section will describe an algorithm that finds the best path through a predictable 
it-diincnsional space using user-defined evaluation functions. The algorithm provides for an 
effective representation of space-time and the ability to functionally define predictable static and 

dynamic objects that map into a subset of a given space-time. 

2.1 - Spatial Representation 

A path finding algorithm that is of any use must provide for: I) an effective representation of 

space, 2) the relationship between the elements making up the space, and 3) the ability to define the 
quality of that space with respect (o a robot using the generated plans. 

To cflcctivly represent space, this model uses uniformly shaped, ^dimensional hyper-cubes 
called "nodes". Each node represents a small chunk of space. Arbitrarily shaped n-dimensiona! 
aicns can be defined through the spatial concatenation of nodes along common (n-l dimensional) 
surfaces. The collective area occupied by the nodes is called "space*, while the remainder of 

existence is referred to as "void*. For example. Figure I shows an arbitrary two-dimensional space 
constructed from the spatial concatenation of square shaped nodes. In general, the size of a node wtll 
be .of at least sufficient size to subsume the size of die robot. 



Figure 1 

42 



The relationship between adjoining nodes (such as the ability to move from one node to another) 
is represented by unidirectional links between each of the nodes (see Figure 2b). The reflexive 
relationship a node has with itself is represented as a link that points back to the node and 
represents the ability to remain at that node. Associated with each of the links is a cost. The cost 
represents such things as: whether the surface between the two nodes is continuous, a downgrade, in 

three-dimensions directly below, or if a node represents a safe place to stop. The function of the 
links is to provide a communication path over which messages can be sent. A node can have up lo 2n 
communication links with its neighboring nodes and one reflexive communication link. Nodes that lie 
on the edge between space and the void will have fewer communication connections. 

The topological features of the space are represented by the cost on links between nodes, 
whereas the actual traversability of a particular region of space is specified by the node's 
traversability constant. The traversability constant represents the relative ability of the robot to 

move over a given node. For example, for a robot with wheels, a concrete floor would have a 

traversability constant near 1.0 while that of quicksand might be on the order of 0.001. 

Consider the spatial representation of the two-dimensional space shown in Figure 2. Part a of 
the Figure shows the physical layout of the example space. Fart b shows how the nodes that define 
the space arc interconnected with one another. The reason for the missing links between some of the 
nodes is that the cost associated with the link is infinite (the cost on other links is not indicated). 

As a result, no paths arc generated that make those transitions. While the suifncc is 
three-dimensional, only a two-dimensional representation is necessary (for this particular example) 

because the costs on the links between nodes allows for the representation of hills, absorbing the 
three-dimensional aspect of the space. For example, a link going down hill could have a lower cost 
than one going tip lull. If the features described above arc taken together, a robust representation of 
(lie salient features of space can be created. 



part a part b 

Figure 2 


2.2 - Representing predictable objects 

For a route planning system to be of value, not only does it have to represent the salient 
features of space, but it must also provide an effective means for representing predictable objects. 
Predictable objects arc those objects, both sialic and dynamic, that have fully predictable behavior in 
both lime and space. An object is represented as a function having the defined space and lime in 

its domain, and some subset of the nodes that make up space in its range. The set of nodes generated 

oil any application of the function consists of those nodes in the given space that are occupied (fully 
or partially) by the object during the given time. For example, consider a model of a simple 
two-dimensional revolving door. This can be represented by defining a function that has four nodes 
forming a square in its range. Then, by setting the function to map onto two of the nodes that arc 
diagonal during odd lime units and onto the other two nodes during even limes, a simple, predictable, 

revolving door can be created. Such functions can be encoded into each of the nodes at setup time, 

thus reducing the amount of outside communication during operation. 
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2.3 • An S1MD Algorithm Tor Route Planning 


The rcpicsentation of the spatial features and predictable objects thus far described provides a 
basis for an algorithm that can be directly implemented on an SIMD machine, such as that in 
lllil!is85). This is accomplished by assigning each node to a processor. Each processor has message 
communication links to other processors that transcribe directly from the node links. A message 
represents the value of a possible transition from one node to another and the quality of the entire 
path leading up to that transition. To perform the task of reclaiming, the generated paths from the 
processors, each processor must maintain a stack. The stack represents a storage place for logging 
the history of the activity at the processor. For now, the simplifying assumption will be made that 
each of the moves made by the robot being simulated will take one unit of lime. For example, the time 
required for the robot to move from one node to an adjacent node, takes one unit of time regardless of 
lire robots previous state. The removal of this assumption will be discussed later (see additional 
features section 2.5). 

Using a synchronous, step-wise process of passing messages from processor to processor, all 
possible paths that the robot could take through time and space iit attaining the desired, static 
destination location can be considcicd. The process has two phases and a terminating condition. 

The first phase sets up the initial message set. Using the node in space that represents the 
current location of the robot, a set of messnges is created, one message for each communication link 
associated with tltc node. Each message has associated with it an energy value that reflects the cost of 
moving the robot to the space represented by the adjoining node. The particular value of a message’s 
energy is detet mined by a user-defined evaluation function. The evaluation function considers such 
things ns: the cuircnt slate of the rolx>t, the cost on the link that the message is to be sent over, the 
navcisability oT the node cuircntly being occupied by the robot, etc. The set of messages is then sent 

to its icspcctivc destinations along the communications links of the node. 

The second phase is the operational phase. It is defined by having each nod** in space that is 
not occupied by an object during the current simulation time perform the following process in a 

synchronous manner: form the base message by setting it to the incoming message with the minimum 

energy. All other messages can be thrown out because they represent more costly paths that attain 
the same location in space time. In a manner similar to that described in the first phase, a new set of 
messages is created. Each message in the new set is assigned an energy that is a function of the base 
message and (lie link over which the message is intended to travel. The base message is then tagged 
with the time and a pointer indicating the node that created it. The base message is (hen added (o the 
node's stack. Finally, the node sends the newly created list of messages out along their respective 
communication link. This process is repeated, until the termination condition is met, each repetition 
representing a subsequent time unit. 

The terminating condition is defined as the state of the system when the energy associated with 
each of the messages currently being processed in tltc system is greater than the global bound. Ihc 
global bound is the minimum energy for all the messages that have readied the destination node 
(similar to zorch decay in |Charnink86|). 

After the ending condition is met, the path through space-time dial has the lowest energy 
associated with it can be retrieved from the destination node. This is done by locating the message on 
ihc destination node's stack with the lowest energy value. Once this is done, die path can be 
obtained by following the pointers back through space (oilier processors) and time ( the slacks 
associated with the processors) until the robot's original location in space-time is encountered. The 
stack allows imcrproccssor communications to be kept to a minimum. 

2.4 - A Predictable Example 

This section shows how predictable dynamic objects are represented and anticipated. Shown 
below in Figure 3 is a two-dimensional space made of ramps that are to be navigated by the robot. A 
dynamic object enters the world at time t = 8 and leaves the world after 1=15, indicated by the 
blackened square in the second of the two Figufes. While the object is present, the node it occupies 
cannot be traversed by the robot. The nodes over which the robot must travel are, for the most part, 
easily traversable and thus have a high Iraversability constant of 0.9. Two of the nodes, however, are 
made of loose sand and have (he low iraversability constant of 0.2. 




Suie of the world for timet 


State of the world for timet 
0<*t< 8 tnd 15 < t 

Travembility Factor - 0.9 
Travenability Factor • 0.2 

Location of dynamic object 


R ■ Robot's Original Location 
D - Robot's Destination Location 


Figure 3 


The route planning objective is, starting at t*0, to move the robot from its current location 
(indicated by the R) to the static destination location (indicated by the D) along the space-time path 
of minimum energy without going over a cliff. By counting the number of node transitions that must 
be made in traveling front R to D, one can determine that the shortest path the robot could lake would 
be 13 moves long and would use 13 time units. But, when the 8th move is being made, the dynamic 
object enters the space and disables the indicated node, keeping it from receiving or sending 
messages. The key is that after seven time units, the message passing process has not propagated to 
the node with the dynamic object in it. This causes all message activity to be confined to the back 
portion of the defined space until after t = 15. The blockage is due to the fact that all paths from R to 
D must pass through the node (hat is occupied by the object. After 1=15 units, the occupied node is 
freed and resumes processing and passing messages, eventually allowing the system to meet its 
terminating condition. 

A number of points can be made here that are based on the choice of the evaluation function 
used to determine the message energies. One is that a proper evaluation function will allow the best 
path to avoid the nodes that arc made of sand by giving a high energy value to any message that 
represents a transition into such a node. A more important point is that the evaluation function 
determines how and where the robot spends its time while it waits for the object to leave. ! : or 
example, the robot could remain at R, charging its batteries, wander along the path, or hurry lo the 
object and wail there until the dynamic object goes away. So, depending on the requirements and the 
situation (c.g., tnaximi/.c charge time), an evaluation function can be written to dclctminc how and 
where the robot wails (c.g., add a cost for stopping and starting, or time spent not charging). 


2.5 - Additional Features 

Some of the system's most powerful features have been omitted thus far for clarity. Among 
these features are: The ability to describe the destination as a function of both space and time, the 

ability to consider the openness of a node with respect to its spatial location, and the ability to 

accurately ctfhsidcr the movement capabilities of the robot using the generated plans. 

The ability to describe the destination as a function of both time and space allows the system 
to solve problems involving alternative planning (e.g. if you can't gel to the post office by five, go to 
the diug store for the stamps) and problems involving coordinating actions with dynamic objects (c.g. 
jumping on a moving train). This ability is incorporated into the system, by modifying the 

termination condition of the algorithm to consider a time ordered set of possible destination nodes. 

The ability to consider the openness of a node as a spatial relation between it and (he nodes 
surrounding it can be used to generate paths that avoid narrow passages, if possible. Gcncr^d plans 

avoid moving the robot through paths that would become blocked if an unpredictable object were to 

be encountered during the execution of the plan. 

’flic following gives an example of how an openness function might be defined iteratively for a 
two-dimensional space. First, assign each node in space a value of one if it is not occupied and a 
value of zero if it is occupied. Second, have each node send its value along all of its communication 

links. Third, each node creates a temporary value by summing the values of all incoming information 

and then integer-dividing them by I the first iteration and 2,4,8,16, ... in each subsequent iteration. 
Lastly, if the temporary value is greater than zero, a new node value is set by multiplying the 




Ihc nodes will eventually converge to a stable node value pattern. The pattern will be such that the 
nodes that are in the biggest, most spacious areas will have the highest values, and the nodes in 
corners or alcoves will have the lowest values. The addition of openness to nodes allows evaluation 
functions to be written that considers the trade-offs between short path length and increased chance 
of backtracking due to the chosen route becoming blocked by an unpredictable object. 

The ability to effectively represent the time required by a robot to make simulated moves 
allows plans to be generated that lake full advantage of the robot’s abilities. For example, moving 
from rest to another node should take longer than moving from one node to another when the robot is 
already moving in the desired direction. This is significantly different from the scheme used up to 
this point, where all moves were considered to take one unit of time. The ability to consider the 
capabilities of the robot in the generated paths has been incorporated into the model by setting the 
model to operate in a more asynchronous manner. This asynchrony is accomplished by associating a 
real-time with each message. The time value of created messages is set by adding to the time in the 
incoming message the amount of time that is required for the robot to make the move represented by 
each of the new messages. The ability to effectively predict the performance of the robot is bounded 
by the precision with which the real-time actions or the robot moving through space-time can be 
modeled. 

3 - Dynamic vs. Unpredictable 

Thus far, only the generation of plans that involve predictable objects has been considered. To 
move autonomous robots in the real world, a route planning system must be able to handle the 
unpredictability that the real world has to offer as in the case of a robot that must walk across a busy 
street. The process of incremental route planning has been identified to handle this problem. 

Incremental route planning can be viewed as the repeated use of a route planner that executes 
in a predictable dynamic environment. After each step, the state of the world is tested and updated 
with any new information, for identification of any unpredictable objects. Incremental route 
planning is effectively handled by this system because it is structured to operate most efficiently in 
the incremental form. Unpredictability is handled by the system's ability to rapidly calculate the 
next best step after every primitive move the robot makes. 

By making a simple modification, an incremental version of the algorithm has been constructed 
from the framework of the previously defined algorithm for predictable domains. The stack is 
eliminated from each of the processors by making an addition to the messages being passed around 
the system. The modification involves the addition of an initial direction header. This change is 
made because all that is needed is the next best move and not the entire path. The headers, of the 
messages created in phase one of the algorithm, are set to a value representing the link along which 
I hat particular message is to be sent. The header, of the messages created during phase two, is copied 
from the header of the incoming message. To identify the next best move, the terminating condition is 
modified to keep track of the message representing the current global message energy bound. When 
the system halts, the header of the global message energy bound indicates the direction of the next 
best move. This represents a significant simplification or the system, as it eliminates concerns 
involving the potentially unbounded growth of the node stacks. 


4 - Implementation 


Hie algorithm, when fully implemented on an SIMD machine, operates in O(p) time, where p is 
the length of the longest path through space-tinie that is bounded by the global message energy 
bound. 


A simulation version of the algorithm, written in NISP (McDermotl83J. is currently up and 
running on a VAX 11-785. It functions on the examples given plus others involving unpredictable 
dynamic environments. Hie implementation includes software for simulating the SIMD architecture. 
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5 • Further Research 



There are several possible extensions to this model that would increase its representational 
power. Among the most useful are: 

- The granularity of the nodes: It be co m es difficult to ensure that there is an adequate path for 
the robot to traverse, when the granularity of the nodes used to represent space is smaller 
than the size of the robot or the exact size of the nodes is undetermined. 

* Uncertainty in dynamic times: Special concern should be given to route planning where 

objects entering the space are predictable in their behavior but have uncertain arrival times. 
For example, the subway train that is running a few minutes behind schedule. 

- Achieving maximal efficiency over a set of destinations: This is similar to the traveling 

salesman problem. 

- Modeling unpredictable processes: The power of an incremental route planner can be 
increased for a particular domain with some mode! of the typical behavior of the 
unpredictable objects in that domain. For example, the route planner could give more useful 
advice to a robot crossing a street if the system had a model of the speed, maneuverability, 
and direction of travel for the autos traveling the road. 

~ Representation and coordination of multiple robots moving through space-time: For example, 

getting Huey, Duey and Luey to meet in the garden on the east end of Use space ship at 3pm. 

This list represents some of the extensions to our model that are currently under investigation. 
Extensions into more abstract domains, such as general problem solving using slate transition 
graphs, are also under consideration. 



6 - Summary and Conclusions 

Planning robot movement in dynamic environments demands that the dynamic aspects of the 
environment be modeled in at least as much detail as the movements of the robot. We have created a 
representation system that allows dynamic aspects of the environment and performance aspects of the 
robot to be easily modeled. It also integrates this model with a high-performance rouu-planning 
algorithm. This system has been extended into an incremental route planner which can be used for 
real-time tactical planning in unpredictable domains. This system has been implemented in an SIMD 
simulator running on a VAX. 
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Nonlinear planners (i.g., ftfUfcWjfSft are often touted as having An efficiency advantage over linear 
planner^^rgr; ST kTPi3)[*^ The reason usually given is that nonlinear planners, unlike their linear coun- 
terparts, are not forced to make arbitrary commitments to the ordenin which actions are to be performed. 
This ability to delay commitment enables nonlinear planners to solve certain problems with far less effort 
than would be required of linear planners. In t his p a pqTi urin a rgil r that this advantage is bought with a 
significant reduction in the ability of a nonlinear planner to accurately predict the consequences of actions. 
Unfortunately, the general problem of predicting the consequences of a partially ordered set of actions is 
intractable. In gaining the predictive power of linear planners, nonlinear planners sacrifice their efficiency 
advantage. There are, however, other advantages to nonlinear planning (e.g., the ability to reason about 
partial orders and incomplete information) that make it well worth the effort needed to extend nonlin- 
ear methods. Tn this p^p^r, w* «»pr l y^* Va ™ ow ™ >L> *** causal inference that supports reasoning about 
partially ordered events and actions whose effects depend upon the context in which they are executed. 
As an alternative to a complete but potentially exponential-time algorithm, ipajjrovide a provably sound 
polynomial-time algorithm for predicting the consequences of partially ordered events. If the events turn 
out to be totally ordered, then the algorithm is complete as well as sound. \ 

Keyw or ds: ea tta al rea so ning, plannin gr pre d ict ion, nonmonotonic inference, data basA management. 
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In this paper, we are concerned with the process of incrementally constructing nonlinear plana (i.e., plans 
represented as sets of actions whose order is only partially specified). Nonlinear planning [2] has long been 
considered to have distinct advantages over linear planning systems such as STRIPS [8] and its descendents. 
One supposed advantage [10] has to do with the idea that, by delaying commitment to the order in which 
independent actions are to be performed, a planner can avoid unnecessary backtracking. Linear planners 
are often forced to make arbitrary commitments regarding the order in which actions are to be carried 
out. Such arbitrary orderings often fail to lead to a solution and have to be reversed. By ordering only 
actions known to interact with one another (i.e., actions whose outcomes depend upon the order in which 
they are executed) the expectation was that nonlinear planners would avoid a lot of unnecessary work. 

The problem in getting delayed-commitment planning to work is that it is often difficult to determine 
if two actions actually are independent. In order to determine whether or not two actions are independent, 
it is necessary to determine what the effects of those actions are. Unfortunately, in order to determine the 
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effects of an action a it is necessary to determine what is true prior to a being executed, and this in turn 
requires that we know the effects of those actions that precede a. In general there is no way to determine 
whether or not two actions are independent without actually considering all of the possible total orderings 
involving those two actions. 

Planning depends upon the ability to predict the consequences of acting. Past planning systems capable 
of reasoning about partial orders (i.e., nonlinear planners) have either employed weak (and often unsound) 
methods for performing predictive inference or they have sought to delay prediction until the conditions 
immediately preceding an action are known with certainty. Delaying predictive inference can serve to 
avoid inconsistency, but it can also result in extensive backtracking in those very situations that nonlinear 
planners were designed to handle efficiently. 

It our contention that the initial success of Sacerdoti’s NOAH [10] program and the promise of NOAH’s 
style of least-commitment planning has caused researchers to ignore important issues in reasoning with 
incomplete information. The idea of least-commitment planning is not the only reason for building planners 
capable of reasoning about partially ordered events. Most events will not be under a planner’s control 
and more often than not it will impossible to determine the order of all events with absolute certainty. 
Planning systems for realistic applications will have to reason about partially ordered events. 

In this paper, we consider the problem of reasoning about the effects of partially ordered actions. A 
theory for reasoning about the effects of actions (or, more generally, the consequences of events) is referred 
to as a causal theory. We will describe a language for constructing causal theories that is capable of 
representing indirect effects and the effects of actions that depend upon the situation in which they are 
applied. We will describe a series of algorithms for reasoning about such causal theories. All of these 
algorithms are polynomial-time, incremental, and insensitive to the order in which facts are added to or 
deleted from the data base. We show that a particular algorithm is complete for causal theories in which 
the events are totally ordered, but is potentially inconsistent in cases where the events are not totally 
ordered. In [6] we show that the general problem of reasoning about conditional actions is NP-completc % 
and, in this paper, we provide a partial decision procedure that, while not complete, is provably sound. 
What this means for a planner constructing a plan is that the procedure is guaranteed not to misslead the 
planner into committing to a plan that is provably impossible given what is currently known. If the decision 
procedure answers yes, then the conditions are guaranteed to hold in every totally ordered extension of 
the current partial order; if the decision procedure answers no, there is a chance that the conditions hold 
in every total order, but to determine this with certainty might require an exponential amount of time or 
space. 

2 Temporal Data Base Management 

In this section, we consider a particular type of inference system, referred to as a temporal data base 
management system (or TDBMS) [4], that is used to keep track of what is known about the order, duration, 
and time of occurrence of a set of events and their consequences. The user of a TDBMS is allowed to add 
two sorts of information: that which is unconditionally believed and that which is believed just in case 
certain conditions can be shown to hold. The former includes information concerning events that have 
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been observed or are assumed inevitable and information in the form of general rules that are believed to 
govern the physics of a particular domain. 

In specifying conditional beliefs, the user explicitly states what the conditions are, and the TDBMS 
ensures that those beliefs (and their consequences) are present in the data base just in case the conditions 
are met. This is achieved through the use of data dependencies [7]. In a TDBMS, the primary forms of 
data dependency (in addition to those common in static situations) are concerned with some fact being 
true at a point in time or throughout an interval. In addition, there is a nonmonotonic form of temporal 
data dependency concerned with it being consistent to believe that a fact is not true at a point in time or 
during any part of an interval. These forms of temporal data dependency are handled in the TDBMS using 
the mechanism of temporal reason maintenance [4]. Language constructs are supplied in the TDBMS that 
allow an application program to query the data base in order to establish certain antecedent conditions 
(including temporal conditions) and then, on the basis of these conditions, to assert consequent predictions. 
These predictions remain valid just in case the antecedent conditions continue to hold. 

Perhaps one of the most important and most overlooked characteristics of a temporal reasoning system 
is the ability to handle incomplete information of the sort one invariably encounters in realistic applications. 
For example, we seldom know the exact duration or time of occurrence of most events. Moreover, for those 
durations and offsets we do know, they are seldom with respect to a global frame of reference such as a clock 
or calendar. In the TDBMS, every point is a frame of reference, and it is possible to constrain the distance 
between any two points simply by specifying bounds on the distance in time separating the two points. 
By allowing bounds to be both numeric and symbolic, the same framework supports both qualitative (i.e. 
ordering) and quantitative (distance) relationships. 

Another important aspect of reasoning with incomplete information has to do with the default character 
of temporal inference. In general, it is difficult to predict in advance how long a fact made true will persist. 
It would be convenient to leave it up to the system to decide how long facts persist based upon the simple 
default rule [9] that a fact made true continues to be so until something serves to make it false. This 
is exactly what the TDBMS does. The term persistence is used to refer to an interval corresponding to a 
particular (type of) fact becoming true and remaining so for some length of time. A fact is determined to 
be true throughout an interval / just in case there is a persistence that begins before the beginning of / 
and it can’t be shown that the persistence ends before the end of /. 

The TDBMS permits the specification of partial orders, but it imposes orderings in situations leading to 
potential incoherency. If the TDBMS encounters two persistence intervals of contradictory types that are not 
ordered with respect to one another, it prompts the calling program to resolve the possible contradiction by 
either imposing an order or explicitly introducing a disjunction. By introducing a disjunction, the calling 
program effectively splits the data base, producing two time lines. The answers returned by queries to the 
TDBMS indicate the disjuncts that must be true for a query to succeed (i.e., the particular time line that 
satisfies the query). There are also language constructs that allow a calling program to eliminate disjuncts 
(and hence time lines) that have been ruled out. Unfortunately, as we will see in Section 5, eliminating 
explicit contradictions is not sufficient to ensure consistency in a system capable of making conditional 
predictions from a set of partially ordered events [1]. Before we continue our discussion it will help to 
introduce some notation. 
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Relations* Let II be the set of point! corresponding to the begin and end of events in a particular 
temporal data base. We define a function D1ST to denote the best known bounds on the distance in 
time separating two points. Given € IT such that DIST(*i, 73) = (low, high), we have: 


• *i < *2 o low > € l 

• f l 5 jrj O (low } high) = (0,0) 

• *i * 2 o (*j -< *j) v (*, s »,) 

• ~<M *7 O high > € 

• *i *2 <=> hi^/i > 0 


- st precedes w% 

- st is coincident with s 3 
- si precedes or is coincident with s 3 

- si possibly precedes s 2 
- si possibly precedes or is coincident with sj 


Tokens* We denote a set of time tokens T = {to, t \ t . * .*«} for referring to intervals of time during which 
certain events occur or certain facts are known to become true and remain so for some period of 
time. The latter correspond to what we have been calling persistences. For a given token t: 

• BEGIN(t), END(t) 6 IT. 

• STATUS(t) € {iNjOUT}, determined by whether the token is warranted (IN) or not (OUT) by the 
current premises and causal theory. 

• TYPE(t) = P where P is an atomic predicate calculus formula with no variables 

• DURATION (t) = DIST(BEGIN(<), END(t)) 

Types. As defined above, the type of an individual token is an atomic formula with no variables (e.g., (on 
blockl4 table42)). In general, any atomic formula, including those containing variables, can be 
used to specify a type. In describing the user interface, universally quantified variables are notated 
?variable-name, the scope of the variable being the entire formula in which it is contained (e.g., 
(on ?x ?y)). In describing the behavior of the inference system, we will use variables of the form 
tp to quantify over tokens of type P (e.g., Vtp € T TYPE (tp) = P). 


3 Reasoning about Causality 

In the TDBMS, a causal theory is simply a collection of rules, called projection rules , that are used to 
specify the behavior of processes. In the following rule, P\ . . . P n , Q \ . . .Q m , E, and R designate types, 
and delay and duration designate constraints (e.g., (e,oo)). In: 

(project (and P t • ■ • Pn 

(M (not (and Qi . . . Q m )})) (U 

E delay R duration) 

P x . . .P n and Qi . . .Q m are referred to as antecedent conditions, E is the type of the triggering event , 
and R refers to the type of the consequent prediction . The above projection rule states that, if an event 

*The symbol t is meant to denote an infinitesimal: a number greater than 0 and smaller than any positive number. 
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El: (project (aad P % . . . P m 

(M (not (aad Q x ... Q m )))) 

E R) 


R2: (project (aad P\ . . . P n ) 
E R) 

R3: (disable (aad Q l . . .Q m ) 
(ab R2)) 

R4: (disable (aad Ri . . . R*) 
(ab R3)) 


Figure 1: Hierarchically arranged projection and disabling rules 

of type E occurs corresponding to the token tg end P\...P n are believed to be true at the outset 3 of tg 
and it is consistent to believe that the conjunction of Q \ . . .Q m is not true at the outset of then, after 
an interval of time following the end of tg determined by delay, R will become true and remain so for a 
period of time constrained by duration (if delay and duration are not specified, they default to (0,0) and 
(c,oo), respectively). In the following, we will be considering a restricted form of causal theory, called a 
type 1 theory, such that the delay always specifies a positive offset (causes always precede their effects). 

We also allow the user to specify rules that serve to disable other rules [11]. Figure 1 shows a standard 
projection rule Rl and a pair of projection and disabling rules R2 and R3 that replace Rl. The rule R3 is 
further conditioned by the rule R4. Assuming just the rules R2, R3, and R4, any application of R2 with 
respect to a particular token t of type E is said to be abnormal with regard to t just in case Q t . . .Q m 
hold at the outset of t and it is consistent to believe that R3 is not abnormal with regard to t. The 
nonmonotonic behavior of type 1 causal theories is specified entirely in terms of disabling rules and the 
default rule of persistence (see Section 2). In addition to their usefulness for handling various forms of 
incomplete information, disabling rules make it possible to reason about the consequences of simultaneous 
actions. The reader interested in a more detailed treatment of causal theories may refer to one of [4], [5], 
or [11]. In parts of this paper, we will ignore disabling rules and speak of causal theories consisting solely 
of simple projection rules of the form (project (and P\ ...P n ) E R). 

One of the most problematic aspects of designing a temporal inference system involves defining precisely 
what it means for a fact to be true at a point or throughout an interval (i.e., the conditions under which a 
query of the form TT(P, arg, * 2 ) will succeed). As a first approximation, we offer the following definition, 
which we will refer to as weak true throughout: 

3 An alternative formulation described in [3] states that the antecedent conditions of a projection rule must be true throughout 
fhe trigger event rather than true just at the outset. Both formulations are supported in the TDDMS, though we will only be 
discussing the true-at-the-outset formulation in this paper. 
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(STATUS(tfl) = IN) A 

(DIST(END(tff ), BEGIN(tit)) Q delay) A 

(DIST(BEGIN(tx)i END(tjjj) Q duration ) 


Figure 2: Weak projection 


Vir 6 II 

3tj. € T (2) 

(STATUS (t r ) = IN) A 
(BEGIN((j>) i »Tj) A 
(ir a BND(tr)) 

O TT(P,* i,r 3 ) 

In order to specify the behavior of a temporal inference system such as the TDBMS, we also need to 

define the criterion for inferring consequent affects from antecedent conditions via causal rules. Our first 

such criterion will be referred to as weak projection (Figure 2) and is defined with respect to the general 
form of a projection rule (1). Weak true throughout and weak projection correspond to the assumption 
that "what you don’t know won’t hurt you.” Only those events that can be shown to be ordered with 
respect to a particular point will have any effect at that point. As we will see in Section 5, there are some 
problems with this formulation. 


4 Transactions on the Data Base: the User Interface 

Every inference system provides some means for the user to specify rules (referred to collectively as a 
causal theory) for inferring additional consequences of the data (referred to here as a set of basic facts). 
An application program interacts with an inference system by adding and removing items from the set of 
basic facts, which in the TDBMS corresponds to a set of tokens and a set of constraints. The state of a 
temporal data base is completely defined by a temporal constraint graph (TCC), consisting of the begin 
and end points of tokens and constraints between them, and a causal dependency graph (CDC), consisting 
of dependency structures corresponding to the application of causal rules in deriving new tokens. Each 
transaction performed on the temporal data base results in changes to these two data structures. The 
TDBMS is responsible for maintaining the temporal data base so that it captures exactly those consequences 
that follow from the causal theory and the current set of basic facts. 
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Generally, the causal theory remains fixed for a particular application, and interaction with the TDBMS 
consists of a series of transactions and queries. A transaction consists of either adding or removing 
some token or constraint from the set of basic facte. A query consists of e predicate calculus formula 
corresponding to a question of the form "Could some fact P be true over a particular interval IT* An 
affirmative answer returned by the TDBMS in response to such a query will include a set of assumptions 
necessary for concluding that the fact is indeed true. Any assertions made on the basis of the answer 
to such a query are made to depend upon these assumptions. There is also a mechanism that enables 
the TDBMS to detect and, with the assistance of the calling program, resolve inconsistencies in the set of 
constraints. 

5 Completeness and Consistency 

The primary source of ambiguity in the TDBMS arises from the fact that the set of constraints seldom 
determines a total ordering of the tokens in T. Given that most inferences depend only upon what is 
true during intervals defined by points corresponding to the begin and end of tokens in T, all that we are 
really interested in is what facts are true in what intervals in the different total orderings of time points 
consistent with the initial set of constraints. For each total ordering we can identify a unique set of tokens 
that intuitively should be IN given a particular causal theory. 

As far as we are concerned, an inference procedure is fully specified by a criterion for inferring consequent 
effects from antecedent cause" via causal rules (e.g. weak projection), a method for actually applying that 
criterion (an update algorithm), and a criterion for determining if a fact is true throughout some interval 
(e.g., weak true throughout). We will say that a particular inference procedure is complete for a class 
of causal theories, if for any set of basic facts and causal theory in the class, the statements of the form 
TT(P,x i,**) warranted by the inference procedure are exactly those that are true in all total orders 
consistent with the constraints in the TCC. Similarly, we will sav that an inference procedure is sound 
for a class of causal theories, if for any set of basic facts and causal theory in the class, each statement 
TT(P, *i, * 2 ) warranted by the inference procedure is true for any total order consistent with the set of 
constraints. Given the preceding definitions, it is easy to show that the TDBMS is complete and sound for 
type 1 causal theories, assuming that the tokens in T are totally ordered [6). 

In situations where the set of basic facts does not determine a total order, it’s easy to show that the 
TDBMS can end up in a state with IN tokens that allow one to conclude statements of the form TT(P, *i, * 2 ) 
that are not true in any totally ordered extension. One thing we might do to improve the chances of the 
TDBMS warranting only valid statements of the form TT(P, jtj, * 2 ) is strengthen the criterion for belief in 
a given token. We can determine a class of tokens that are said to be strongly protected , using the axioms 
in Figure 3. In these axioms and the rest of the paper, we use Tq to denote the tokens in the set of basic 
facts. If the set of constraints determines a total ordering, then the set of strongly protected tokens is 
identical to the set of tokens that are IN, but generally the former is a subset of the latter. Using this 
notion of strongly protected, we can define a stronger true throughout criterion that we will refer to as 
strong true throughout: 
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Figure 3: Strong protection defined for simple projection rules 
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Figure 4: Improbably weak projection defined for simple 
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As it turns out, weak projection and strong true throughout still do not constitute a sound inference 
procedure. We can show that, even when we restrict ourselves to strongly protected tokens, most interesting 
decision problems are intractable. In fact, we can prove that the problem of determining if TT[P, *i, *j) is 
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true for a type 1 causal theory, with or without disabling rules, is NP-complete [6]. Although completeness is 
computationally infeasible, it is possible to devise an inference procedure that is both sound and capable of 
performing useful prediction. First, we provide a criterion for generating consequent predictions that takes 
into account every consequence that might be true in any total order, called improbably weak projection 
(Figure 4). Second, we provide a criterion for true throughout that succeeds only if the corresponding 
formula will be true in all total Jers consistent with the current set of constraints. We define improbably 
strong true throughout: 

Vvitfa € II 

3 tp e T (4) 

STRONGLY-PROTECTEDfo ) A 
(BEGIN(t/») < w x ) A 
(VUr € T 

($TATUS(U/») = OUT) V 
(BEGIN(U/») -< BEClN(t/»)) V 
(v 3 -< BEGlN(Up)) ) 

TT(P % *i, *3) 

There is a simple decision procedure for generating all consequences and computing the set of strongly 
protected tokens. Let Ti = Tq, and initially assume that no tokens are strongly protected. Let i = 1. To 
compute the consequences of Ti, set T*+i = T,*, compute the consequent tokens of each token in T* using 
the criterion of improbably weak projection, and add any new tokens to T,+i- Continue to compute new 
consequent tokens in this manner incrementing i as needed until Ti = T,+i. Set T = Ti. At this point, 
perform a sweep forward in time (relative to the current partial order) determining for each token in T 
whether or not it is strongly protected and the status, IN or OUT, of each its consequents. In [6], we prove 
that this decision procedure is sound for a partially ordered set of tokens, and sound and complete for a 
totally ordered set. 

In the same paper, we give two incremental update algorithms with polynomial-time worst case be- 
havior, one for weak projection and weak true throughout, and one for improbably weak projection and 
improbably strong true throughout. The latter algorithm does not model the decision procedure given 
above — there is a more complicated procedure with the same behavior that is more efficient by a constant 
factor. We prove that these algorithms support exactly the conclusions licensed by their respective infer- 
ence methods. Proving that the algorithms terminate is in one sense impossible. Using the TDBMS and a 
type 1 causal theory with arithmetic functions (e.g., (project (contents Tregister ?n) (increment 
?regi8ter) (contents ?register (♦ 1 ?n) ))), we can easily simulate a Turing machine. There are a 
number of methods for either restricting what the user can encode in a causal theory or limiting the scope 
of the inferences computed by the TDBMS in such a way that we can guarantee that the update terminates. 
By limiting the scope of the inferences computed by the TDBMS, we potentially sacrifice completeness, but 
we have shown that to ensure completeness may require an exponential amount for time /or other reasons. 


6 Conclusions 

This paper is concerned with computational approaches to reasoning about time and causality, particularly 
in domains invc Iving partial orders and incomplete information. We have described a class of causal theories 
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involving a carefully restricted use of nonmonotonic inference, capable of representing conditional effects 
and the effects of simultaneous actions. We showed in [6] that the decision problem for nontrivial inference 
systems involving conditional effects and partially ordered events is NP^eompUte. As an alternative to a 
complete but potentially exponential-time inference procedure, we have described a decision procedure, for 
which there is an incremental polynomial-time algorithm, that generates a useful subset of those inferences 
that will be true in all total orders consistent with the initially specified partial order. The decision 
procedure is provably sound and the resulting conclusions are guaranteed consistent if the underlying 
causal theory is consistent. If the events turn out to be totally ordered, the procedure is complete as well 
as tound. 


References 

1. Chapman, David, Planning for Conjunctive Goal*, Technical Report AI-TR-802, MIT AI Laboratory, 1985. 

2. Charniak, Eugene and McDermott, Drew V., Introduction to Artificial Intelligence (Addison- Wesley Publishing 
Co., 1985). 

3. Dean, Thomas, Temporal Imagery: An Approach to Reasoning a&out Time for Planning and Problem Solving, 
Technical Report 433, Yale University Computer Science Department, 1985. 

4. Dean, Thomas, and McDermott, Drew V., Temporal Data Base Management, to appear in Artificial Intelli- 
gence ■. 

5. Dean, Thomas, An Approach to Reasoning About the Effects of Actions for Automated Planning Systems, to 
appear in the 1987 volume of the Annals of Operations Research entitled "Approaches to Intelligent Decision 
Support*. 

6. Dean, Thomas, and Boddy, Mark, Incremental Causal Reasoning , Technical Report CS-87-1, Brown University 
Department of Computer Science, 1987. 

7. Doyle, Jon, A truth maintenance system, Artificial Intelligence 12 (1979) 231-272. 

8. Fikes, Richard and Nilsson, Nils J., STRIPS: A new approach to the application of theorem proving to problem 
solving, Artificial Intelligence 2 (1971) 189-208. 

9. Reiter, Raymond, A Logic for Default Reasoning, Artificial Intelligence IS (1980). 

10. Sacerdoti, Earl, A Structure for Plans and Behavior (American Elsevier Publishing Company, Inc., 1977). 

11. Shoham, Yoav, Chronological Ignorance: Time, Knowledge, Nonmono tonicity and Causation , in Georgeff, 
Michael P. and Lansky, Amy L. (Eds.), The 1986 Workshop on Reasoning about Actions and Plans , (Morgan- 
Kaufm^n 1987). 


58 



5 >• / 2 

~-'V — tp Zj 


% 



' /' 


* ;<fv- 


/* w/i 


The Generic Task Toolset: High Level Languages for the 
Construction of Planning and Problem Solving Systems 



B. Chandrasekaran, J. Jotephaon, and D. Herman 
, _ c^y.Ohio State University 

■ fid** 1 


Columbus, OH 43210 


Abstract 


r> 


/( 


;}■ 



5^ 




■U'f 


begin by/ criticizing the current generation of lan- 
guages for the /construction of knowledge- based systems as 
being at too Uiw a level of abstraction/ the 
need for higher level languages for building problem solv- 
ing systems/ We*- introduce mrrffi lotion of generic infor- 
mation processing tasks in know [edge- based problem solv- 
ing, and dm c r i b e^a toolset which can be used to build rx- 
systems in way that enhances intelligibility and 
productivity in knowledge acquisition and system construe 
lion. W j. il lu » t f i4 e ihe power of these ideas s£y paymi 
inattention to Vigh "level- language called DSPL. 
aoiiUdMBsibe^how it was used in the construction of a sys- 
tem called MPA, which assists with planning in the 
domain of offensive counter air missions. 
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that rules of medicine are not per se matters of A! 
research. That is, the content of the knowledge base itself 
is made up of domain particulars. At the same time the 
language in which the rule system is written, e.g.. Lisp, is 
typically thought of as an implementation-level detail, of 
no particular interest as A I. 
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Scope and Intent 
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is intended to be an introduction to the 
generic taslls approach to analyzing knowledge systems, 
descriptions l of some of the particular generic tasks that 
have been Identified, and a description of the software 
tools that have been build as a result. The approach is 
illustrated bu discussing a particular mission-planning sys- 
tem, MPA, which was built using the DSPL language (or 
tool). The intent of the paper is primarily tutorial, much 
of the material is summary or repetition of material al- 
ready presented elsewhere. 1, i 

' ///pi/r 

Level of Absti rtlon Problem In Characterizing 
Knowledge- B a d Systems 


Much of A I 
Grail” of a level 
havior qua intelligi 
presumably, are 


domain, and below 
the intelligent level 
rule-based system su 



The AI interest of a given expert system is often a mat* 
ter of the level at which it is viewed. Clearly, taking 
Mycin as an example, all the following points of view are 
correct: (i) it suggests therapy for certain kinds of infec- 
tious diseases, (ii) it is an embodiment of a diagnostic and 
therapeutic strategy, and (iii) it is a decision-maker which 
uses backward chaining to navigate a knowledge base of 
ules, connecting pieces of evidence to conclusions, in order 
lo arrive at a reliable conclusion from a given set of data. 

] Joint of view (i) is of limited interest to Al. and point of 
iew (iii) has been the level at which the system as an AI 
( A system has been generally presented; this is the level at 
vhich the claim for generality of the underlying approach 
s made. All the excitement surrounding the “knowledge 
^base/inference engine” paradigm, and the idea that 
knowledge acquisition and explanation can be keyed to 
phenomena at the rule level, emphasizes that the level cor- 
responding (iii) has been the level of abstraction at which 
AI interest has been expressed, and claims of progress 
have been made. 


be said to be a search for the “Holy 
abstraction at which intelligence be- 
t behavior emerges. Above this level, 
icular pieces of knowledge of the task 


level are specific details of how 
implemented. For example, in a 
as Mycin. everyone would agree 
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What of the level corresponding to (ii)? We have for 
several years at Ohio Slate worked on the hypothesis that 
this is indeed an important level of abstraction for AI; 
and that knowledge representation, control regimes for 
problems solving, knowledge acquisition, and explanation 
all can be significantly advanced by looking at phenomena 
at this level. B. Chandrasekaran has put forth this view 
in 1, 4 . In this paper we give a critical analysis of an 
important part of A I, viz., know I edge- based reasoning, and 
propose that a point of view based on a particular level of 
abstraction corresponding to generic information 
processing tasks has a number of advantages both for 
clarity ol analysis and for system design. Tins is not 
pure theoretical speculation; researchers in our laboratory 
have built many systems and tools based on this 
framework. We wish to point to this level of abstraction 
as a productive level for concentration, and to indicate the 
conceptual advantages of it. Knowledge acquisition, sys- 
tem design, control of problem solving and generation of 
explanation ail are facilitated at the same time, indicating 
that there is a naturalness to looking at phenomena at 
this level. 
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Intuitively one thinks that there must be some com- 
monality of reasoning processes that characterize 
“diagnosis” as a generic activity, even across domains as 
different as medicine and mechanical systems. There 
should be control strategies and ways of using knowledge 
that are common to diagnostic reasoning as such, or at 
least typical of diagnostic reasoning. Similarly there 
should be common types of knowledge structures and con- 
trol strategies for, say, design as a kind of reasoning ac- 
tivity. Further, we expect that the structures and control 
regimes for diagnostic reasoning will be generally different 
from those for design reasoning. However, when one looks 
at the formalisms (or equivalently the languages or shells), 
that are commonly used in expert system design, the 
knowledge representation and control regimes do not typi- 
cally capture these distinctions. For example, in diagnos- 
tic reasoning, one might generically wish to speak in term* 
of malfunction hierarchies, rule-out strategies, setting up a 
differential, etc., while for design, the generic terms might 
be device/component hierarchies, design plans, ordering of 
design subtasks, etc. Ideally one would like to represent 
diagnostic knowledge in a domain by using the vocabulary 
that is appropriate for the task. But typically the lan- 
guages in which the expert systems have been imple- 
mented have sought uniformity across tasks, and thus 
have lost clarity of representation at the task level. The 
coniputatjpnaj universality of representation languages such 
as Emycin or OPS5 - i.e., the fact that any computer 
program can in principle He written in these languages ~ 
often confuses the issue, since after the system is finally 
built it is often unclear which portions of the system 
represent domain expertise, and which are programming 
devices. In addition, the control regimes that these lan- 
guages come with (such as forward or backward chaining) 
do not explicitly indicate the real control structure of the 
system at the task level. For example, the fact that Hi 
5 performs a linear sequence of subtasks - an atypically 
simple strategy for design problem solving — is not ex- 
plicitly encoded; the system designer so to speak 
“encrypted” this control in the pattern-matching control of 
OPS5. 

These comments need not be restricted to the rule-based 
framework. One could represent knowledge as sentences in 
a logical calculus and use logical inference mechanisms to 
solve problems; or one could represent it in a frame 
hierarchy with procedural attachments in the slots. (It is 
a relatively straightforward thing, e.g, to rewrite MYCIN 
h in this manner, see'.) In the former, the control issues 
would deal with choice of predicates and clauses, and in 
the latter, they will be at the level of which links to pur- 
sue for inheritance, etc. None of these have any natural 
connection with the control issues specific to the task. 

The other side of the coin, so to speak, regarding con- 
trol is the following; because of the relatively low level of 
abstraction relative to the information processing task, 
there are control issues that are artifacts of the represen- 
tation, but often in our opinion misinterpreted as issues at 
the “knowledge-level.” K.g., rule-based approaches often 
concern themselves with conflict resolution strategies. If 
the knowledge were viewed at the level of abstraction ap- 
propriate to the task, often there will be organizational 
elements which would result in only a small set of highly 
relevant pieces of knowledge or rules to being brought up 
for consideration, without any conflict resolution strategies 
being needed. Of course, these organizational constructs 


could be “programmed” in the rule language, but because 
of the status assigned to the rules and their control as 
knowledge-level phenomena (as opposed to the implemen- 
tation level phenomena, which they often are), knowledge 
acquisition is often directed towards (typically syntactic) 
strategies for conflict resolution, whereas the really opera- 
tional expert knowledge is at the organizational level. 

This level problem with control structures is mirrored in 
the relative poverty of know ledge- level primitives for 
representation. For example the epistemology of rule sys- 
tems is exhausted by data patterns (antecedents or 
subgoals) and partial decisions (consequents or goals), that 
of logic is similarly exhausted by predicates, functions, in- 
ference rules, and related primitives. If one wishes to talk 
about types of goals or predicates In such a way that con- 
trol behavior can be indexed over this topology, such a 
behavior can often be programmed into these systems, but 
no explicit rendering of them is possible. E.g., Clanrey 
* found in his work using Mycin to teach students that 
for explanation he needed to attach to each rule in the 
Mycin knowledge base encodings of types of goals so that 
explanation of its behavior can be couched in terms of 
this encoding, rather than only in term of “Because 
was a subgoal of - This is not to argue that rule 

representations and backward or forward chaining control* 
are not “natural” for some situations. If all that a 
problem solver has for knowledge in a domain is in the 
form of a large collection of unorganized associative pat- 
terns, then data-directed or goal-directed associations may 
be the best that the agent can do. But that is precisely 
the occasion for weak methods such as hypothesize and 
match (of which the above associations are variants), and. 
typically, successful solutions cannot be expected in com- 
plex problems without combinatorial searches. Generally, 
however, expertise can be expected to consists of much 
more organized collections of knowledge, with control be- 
havior indexed by the kinds of organizations and forms of 
knowledge in them. 

Thus, there is a need for understanding the generic in- 
formation processing tasks that underlie knowledge-based 
reasoning. Knowledge ought to be directly encoded at the 
appropriate level by using primitives that naturally 

describe the domain knowledge for a given generic task. 
Problem solving behavior for the task ought to be con- 
trolled by regimes that are appropriate for the task. If 
done correctly, this would simultaneously facilitate 

knowledge representation, problem solving, and explana- 
tion. 

At this point it will be useful to make further distinc- 
tions. Typically many tasks that we intuitively think of 
as generic tasks are really complex generic tasks. I. e.. 
they are further decomposable into components which are 
more elementary in the sense that each of them has a 
more homogeneous control regime and knowledge struc- 
tures. For example, what one thinks of as the diagnostic 
task, while it may be generic in the sense that the task 
may be quite similar across domains, it is not a unitary 
task structure. Diagnosis may involve classiflcatory 
reasoning at a certain point, reasoning from one datum to 
another datum at another point, ami abduclive assembly 
of multiple diagnostic hypotheses at another point. 
Hierarchical classification has a different form of knowledge 
and control behavior from those for data-to-data reasoning, 
which in turn is dissimilar in these dimensions from as- 
sembling hypotheses. Our research focuses on tasks at 
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both these levels, but the latter are viewed as somewhat 
“atomic” tasks into which more complex, but still generic, 
tasks such as diagnosis and design can often be decom- 
posed. 

To summarize the view presented so far: There is a need 
for understanding the generic information processing tasks 
that underlie know ledge- based reasoning. Knowledge ought 
to be directly encoded at the appropriate level by using 
primitives that naturally describe the domain knowledge 
for a given generic task. Problem solving behavior for the 
task ought to be controlled by regimes that are ap- 
propriate for the task. If done correctly, this would 
simultaneously facilitate knowledge representation, problem 
solving, and explanation. 

Over the years, we have identified, and built systems 
using, six such generic tasks. Our work on MDX 0, ,0 . 
e.g., identified hierarchical classification, 

knowledge-directed information passing. and 
hypothesis matching as three generic tasks, and showed 
how certain classes of diagnostic problems can lx* imple- 
mented as an integration of these generic tasks. Since 
then we have identified several others: object synthesis 
by plan selection and refinement 11 , state 
abstraction 4 , and abductive assembly of 
hypotheses 1 *. There is no claim that these six are ex- 
haustive; in fact, our ongoing research objective is to iden- 
tify other useful generic tasks and understand their 
knowledge representations and strategies for control of 
problem solving. 

Some Generic Tasks 

Characterization of Generic Tasks 

Each generic task is characterized by: a task 

specification in the form of generic types of input and 
output information; specific forms of knowledge needed 
for the task, and specific organization of knowledge 
particular to the task; a family of control regimes that 
are appropriate for the task. 

A task-specific control regime comes with certain charac- 
teristic types of strategic goals. These goal types will 
play a role in providing explanations of its problem solv- 
ing behavior. 

When a complex task is decomposed into a set of 
generic tasks, it will in general be necessary to provide for 
communication between the different structures specializing 
in these different types of problem solving. Also there is 
not necessarily a unique decomposition. Depending upon 
the availability of particular pieces of knowledge, different 
architectures of generic tasks will typically be possible for 
a given complex task. 

We will now give brief characterizations of the generic 
tasks that we have identified. 

Taxonomic Classification 

Task specification: Classify a (possibly complex) descrip- 

tion of a situation as an element, as specific as possible, 
in a classification hierarchy. E.g. classify a medical case 
description as an element of a disease hierarchy. 


Forms of knowledge: one main form is < partial situa- 

tion description > — > evidence /belief about confirmation 
or disconfinnation of classificatory hypotheses. E.g., in 
medicine, a piece of classificatory knowledge may be: cer- 
tain pattern in X-ray & bilirubin in blood — > high 
evidence for cholestasis. 

Organization of knowledge: knowledge of the form above 
distributed among concepts in a classificatory concept 
hierarchy. Each conceptual “specialist” ideally contains 

knowledge that helps it determine whether it (the concept 
it stands for) can be established or rejected. 

Control Regime: Problem solving is top down, each con- 
cept when called upon tries to establish itself. If it suc- 
ceeds, it lists the reasons for its success, and calls its suc- 
cessors, which repeat the process. If a specialist fails in 
its attempt to establish itself, it rejects itself, and all its 
successors are also automatically rejected. This control 
strategy can be called Establish- Refine, and results in a 
spec i He classification of the case. (The account is a 

simplified one. The reader is referred to 10 for details and 
elaborations.) 

Goal types: E.g., Establish <concept>, Refine 

(subclassify) <concept> 

Example Use: Medical diagnosis can often be viewed as 

a classification problem. In planning, it is often useful to 
classify a situation as of a certain type, which then might 
suggest an appropriate plan. 

Object Synthesis by Plan Selection and Refinemen t 

Task Specification: Design an object satisfying specifica- 

tions (object in an abstract sense: they can be plans, 
programs, etc.). 

Forms of knowledge: Object structure is known at some 

level of abstraction, and pre-compited plans are available 
which can make choices of components, and have lists of 
concepts to call upon for refining the design at that level 
of abstraction. 

Organization of Knowledge: Concepts corresponding to 
components organized in a hierarchy mirroring the object 
structure. Each concept has plans which can be used to 
make commitments for various '‘dimensions” of the com- 
ponent. 

Control Regime: Top down in general. The following is 
done recursively until a complete design is worked out: A 
specialist corresponding to a component of the object is 
called, the specialist chooses a plan based on the specifica- 
tions and problem state, instantiates and executes the plan 
which suggests further specialists to call to set details of 
the subcomponents. Plan failures are passed up unli! ap- 
propriate changes are made by higher level specialists. 

Goal Types: E.g., Choose plan, execute plan element •. 
refine <plan>, redesign (modify) partial design * to 
respond to failure of '.subplan. *, select alternative plan, 
etc. 

Example: Expert design tasks, routine synthesis of plans 

of action. 

We will characterize the other generic tasks more suc- 
cinctly. The reader is referred to 1 for more details. 
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Knowledge-Directed Information P assing 

Task: It is desired to obtain attributes of some datum, 
by deriving from some conceptually related datum. Some 
forms of knowledge are: <attribute> of <datum> is in- 

herited from < attribute > ijnM parent of <datum>, 
<attribute> of <datum> is related as <relation> to 
<attribute> of <concept>. Organization: concepts are or- 
ganized as a frame hierarchy * with IS- A and PART-0 F 
links. Each frame is a specialist in knowledge-directed 
data inference for the concept. This is basically a hierar- 
chical information-passing control regime. 

Example uses: knowledge-based data retrieval tasks in 
wide variety of situations, as an intelligent data base in 
support of problem solvers of other types. 

Abductivc Assembly of Explana tory Hypotheses 

Task Specification: Given a situation (described by a set 
of data items) to be explained by the best explanatory ac- 
count, an d 0 iven a number of h) t . theses, each associated 
with a degree of belief, and each of which offers to ex- 
plain a portion of the data (possibly overlapping with data 
to be accounted for by other hypotheses), construct the 
best composite explanatory hypothesis. Some forms of 
knowledge are: causal or other relations (e.g. special case 
of, incompatibility, suggestiveness) between the hypotheses, 
relative significance of data items, and ways to group data 
items to be explained. Organization: one main, or a 
hierarchical community of active abducers, each specializ- 
ing in explaining a certain portion of the data by compos- 
ing and criticizing hypotheses. Control Regime: (See 

13 for a fuller discussion.) A specialized means-ends 
regime is in control, driven by the goals of explaining all 
the significant findings, with an economical hypothesis, 
which is consistent, and has been criticized for certain 
strengths and weaknesses. Some goal types are: account- 
for <datum,': check-superfluousness-of v hypothesis >. 

choose the most significant unexplained finding. The In- 
ternist system 11 and the Dendral system 15 perform abduc- 
tive assembly as part of their problem solving. 

St ate Abst raction 

Task Specification: Given a change in some state of a 
system, provide an account of the changes that can be ex- 
pected in the functioning of the system. (Useful for 

reasoning about consequences of changes on complex 
systems.) One knowledge form is * change in state of 

subsystem> — "> - change in functionality of subsystem - 
change in state of the immediately larger system The 

knowledge is organized into conceptual specialists cor- 
responding to systems and subs) stems connected in a way 
mirroring the way the systems and subsystems are put 
together. The control is basically bottom up. following 

the architecture of the system/ subsystem relationship. 
The changes in stales are followed through, interpreted as 
changes in functionalities of subsystems, until the changes 
in the functionalities at the level of abstraction desired are 
obtained. This form of reasoning is useful for answering 
questions like: **What system functionalities will be com- 
promised if this valve fails closed?’’. 

Hypothesis Matchi ng 

Given a concept and a set of data that describe the 
problem state, decide if the concept matches the situation. 
The idea here is to encode the routine knowledge for 


verification and refutation that the concept applies to the 
situation. One way this can be done is by using a hierar- 
chical representation of evidence abstractions, where the 
top node determines the overall degree of matching of the 
hypothesis to the data, and lower-level nodes represent 
components or features of evidence for the evidence 
abstraction at higher levels. Form of knowledge are such 
as to enable evaluation of strength for each evidence 
abstraction, and to support mapping degrees of belief ;r 
each of these evidence abstractions, to degree of belief in 
the higher abstractions. Strength for an evidence abstrac- 
tion can be determined Each evidence abstraction can be 
determined by matching against prototypical patterns 
which have evidential significance. Samuel’s sigruUutt 
tables can be thought of as performing this task. 

How Existing Expert Systems can be Analysed In 
This Framework 

Separating the implementation language and the intrinsic 
nature of the tasks has been argued as being salutary for 
a number of reasons. Let us look at some of the better 
known expert systems from the perspective of the 
framework developed so far in this paper. 

MYCJN’s task is to (i) classify a number of observations 
describing a patient's infection as due to one or another 
organism, and (ii) once that is done, to instantiate a plait 
with parameters appropriate for the particular patten* 
situation. We have shown in lt * how the diagnostic portion 
of MYCIN can be recast as a classification problem solver, 
with a more direct encoding of domain knowledge and a 
control structure that is directly appropriate to this form 
of problem solving. 

Prospector 17 classifies a geological description as one of a 
p re-enumerated set of formations. Internist 14 generates 
candidate hypotheses by a form of enumeration 
(plausibility scoring and keeping only the top few) and 
uses a form of abductive assembly to build a composite 
hypothesis that accounts for all of the data. Assembly 
and hypothesis enumeration alternate. Dendral 15 generates 
candidate hypotheses by a form of hypothests matching and 
uses a form of abductive assembly which puts together the 
best molecular hypothesis from the fragments produced by 
the matching process. 

Note that in these analyses we have not mentioned rules 
(Mycin), networks (Prospector), graphs (Dendral), etc., 
which are the means of encoding and carrying out the 
tasks. This separation is an essential aspect of what we 
mean by the “right” level of abstraction in analysis. 

Encoding Knowledge at the Level of the Task: 

The Generic Task Toolset 

For each generic task, the form and organization of the 
knowledge directly suggests the appropriate representation 
in terms of which domain knowledge for that task can be 
encoded. Since there is a control regime associated with 
each task, the problem solver can be implicit in ;he 
representation language. That is, a shell for each generic 
task can be constructed such that, as soon as knowledge 
is represented in the shell, a problem solver which uses 
the control regime on the knowledge can be created by 
the interpreter. This is similar tn what representation 
systems such as KMYCIN do. hut note that we are 


62 


deliberately trading generality at a lower level to gain 
specificity, clarity, and richness of ontology and control at 
a higher level. 

We have designed and implemented representation lan- 
guages for versions of each of the six generic tasks we 
described. Here is a list of the generic task tools, each 
with a brief description of the task for which it is 
designed. 

• HYPER for matching concept to situation to 
determine confidence or appropriateness. 

• CSRL for taxonomic classification, typically a 
major component of diagnostic reasoning . 18 

• DSPL for object synthesis by plan selection 
and refinement, captures knowledge for certain 
routine design and planning tasks . 10 

• IDABLE for knowledge-directed information 
passing for intelligent data retrieval. 

• PEIRCE for assembly and criticism of com- 
posite explanatory hypotheses, a form of 
abduction or best-explanation reasoning . 20 

• WWHI (What Would Happen IQ for predic- 
tion by abstracting state changes. 

We have described how this approach directly helps in 
providing intelligible explanations of problem solving in ex- 
pert systems . 21 The approach has a number of other im- 
plications. E.g.. uncertainty Handling in problem solving is 
usefully viewed as consisting of different types for each 
kind of problem solving, rather than as a uniform general 
method .- 2 

In principle the tools can be used together to build com- 
posite problem solvers that integrate the different types of 
reasoning associated with the generic tasks. Systems have 
been built integrating more than one type of reasoning 
(the Red 21 system for example relies on four of the types) 
but these systems predate the availability of the tools. At 
present the actual toolset consists of separately imple- 
mented tools in a variety of languages: each tool having 
an incarnation in Interiisp. LAIR has under development 
an integrated version of the toolset in Common Lisp. 

The computational architecture of a problem-solving sys- 
tem (or system component) built with any of the tools is 
based on functionally distinct, highly modular elements, 
tightly organized. A generic task problem solver is a 
community of agents, where each agent is of a specific 
type, each has its own embedded knowledge. The agents 
are organized so that they have specific lines of com- 
munication with each other: and. depending on the generic 
task, they pass control around in a well-defined way in or- 
der to cooperate and solve the problem. A II YPER-built 
system is made up of knowledge groups, hierarchically or- 
ganized: a (.SRL-built system consists of a hierarchical 
community of classification specialist, each specializing in a 
single classificatory concept. This sort of system architec- 
ture. besides making implementation in an object-oriented 
programming framework fairly easy, makes for systems 
that are distributable and have predictable concurrencies. 
The high degree of modularity— modules having clear func- 
tions. and clear interactions with other modules— makes for 
good software engineering at the knowledge level, 
“structured programming of knowledge bases" if you will. 


DSPL for building a Million Planning Aislitant 

We will describe the use of DSPL (Design Specialists 
and Plans Language) for the design and implementation of 
an expert system in the domain of tactical air mission 
planning. After investigating K NO IIS system from 

MITRE Corporation, an existing mission planning system, 
we developed the Mission Planning Assistant (MPA) using 
DSPL and our generic task approach to building expert 
systems. KNOBS was the primary source of domain 
knowledge for the MPA system. Our project had two 

main goals. First, we wanted to explore the use of DSPL 

for routine planning tasks. Initially, DSPL was developed 
as a result of studying a routine mechanical design task . 11 
It seemed to us, however, that routine planning shares 

many of the characteristics of routine design, suggesting 
that they might require some of the same kinds of 

problem-solving activities. Secondly, we wanted to inves- 
tigate the explanation facilities that are necessary in plan- 
ning systems. We wanted to demonstrate that our generic 
task architectures provide very natural and comprehensive 
frameworks for explanation. 

Tactical mission planning in the Air Force essentially in- 
volves the assignment of resources to various tasks. The 
resources are primarily aircraft and their stores located at 
airbases across the theater of operations. The tasks are 
specified by an ” apportionment” order issued by the Joint 
Task Force Commander to a Tactical Air Control Center 
(TACO). This order describes the overall military objec- 
tives as determined by the Task Force Commander. The 
TACC is responsible for assigning aircraft and personnel 
from specific military units to meet the objectives of the 
apportionment order. The result of these assignments is 
an "Air Tasking Order” (ATO) which summarizes the 
responsibilities of each unit with respect to the day's mis- 
sions. Each mission planned requires attention to such 
details as the selection of aircraft type appropriate to the 
mission, selection of a base from which to fly the mission, 
and coordination with other missions. 

The MPA system we developed currently addresses onlv 
a single type of mission, the Offensive Counter-Air (OCA) 
mission. An OCA mission is an air strike directed specifi- 
cally against an enemy's airbase. Our selection of the 
OCA mission arose primarily because of the availability of 
the KNOBS system and its knowledge base of relevant 
domain facts. Our approach to tactical mission planning 
treats the Air Tasking Order (ATO) as an abstract device 
to be designed. The planning of the missions of the com- 
pleted ATO involves a process similar to the process a 
designer undergoes when faced with a complex mechanical 
device to design. A view of design problem solving should 
illuminate this idea. For a more comprehensive descrip- 
tion of design see 11 . 

Routine Design and DSPL 

The general domain of design is vast: it involves 
creativity, many different problem-solving techniques, and 
many kinds of knowledge. Goals are often poorly 

specified, and may change during the course of problem 
solving. A spectrum of classes of design problems can be 
discerned, varying in complexity from those problems re- 
quiring significant amounts of “creativity", to the most 
routine design problems requiring no creativity at ail. 
The complexity of a design problem will depend on what 
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pieces of knowledge are available to the problem solver 
prior to the start of design, that », the right pieces of 
knowledge can remove the need for creativity and turn a 
complex design task into a routine one. 

What we have called ''Class 3 Design** characterizes a 
form of routine design activity which postulates that 
several distinct types of knowledge are available prior to 
problem solving. First we assume that complete 
knowledge of the component breakdown of the to-be* 
designed device is available to the problem solver, includ- 
ing knowledge of what component attributes need to be 
specified in order to specify a design. The final design 
will consist only of components known in advance, and no 
novel components need to be synthesized. Secondly, we 
assume that knowledge is available in the form of plan 
fragments ({escribing the actions required to design each 
component. A plan for designing a component will typi- 
cally include the designing of subcomponents as steps in 
the plan. Thirdly, we assume that recognition knowledge 
is available that will allow the problem solver to select be- 
tween the alternative plans for designing a component, 
depending on the design requirements and the state of the 
problem solving. The problem solving proceeds by follow- 
ing a top-down process of plan selection and refinement, 
with localized back up and selection of alternative plans 
upon failure of a design plan at any level. While the 
choices at each point tnav be simple, the design process 
overall may be quite complex, and objects of significant 
complexity ran be designed. It appears that a significant 
portion of the everyday activity of practicing designers can 
be analyzed as class 3 design. 

In !)S|*L. a design problem solver consists of a hierarchy 
of cooperating, conceptual specialists, with each specialist 
responsible for a particular portion of the design. 
Specialists higher up in the hierarchy deal with the more 
general aspects of the device being designed, while 
specialists lower in the hierarchy design specific sub- 
portions of the device, or address other design suhtasks. 
Any specialist may access a design data-base (mediated by 
an intelligent data-base assistant). The organization of 

the specialists and the specific content of each one is in- 
tended to precisely capture the human designer’s expertise 
in the problem domain. Kach specialist in the design 
hierarchy contains locally the design knowledge necessary 
to accomplish that portion of the design for which it is 

responsible. There are several types of knowledge 

represented in each specialist, three of which are described 
here. First, explicit design plans in each specialist encode 
sequences of possible actions to successfully complete the 
specialist’s task. Different design plans within a specialist 
may encode alternative action sequences, but all of the 

plans within a particular specialist are always aimed at 
achieving the specific design goals of that specialist. A 
second type of knowledge encoded within specialists is en- 
coded in design plan sponsors. Kach design plan has an 
associated sponsor to determine the appropriateness of the 
plan in the run-tirne context. The third type of planning 
knowledge in a specialist is encoded in design plan selec- 
tors. The function of the selector knowledge is to ex- 
amine the run-time judgments of the design plan sponsors 
and determine which of the design plans within the 
specialist is most appropriate to the current problem con- 
text. 

Control in a USPL system proceeds downwards from the 
top-most specialist in the design hierarchy. Beginning 


with the top-moat specialist, each specialist aelects a 
design plan appropriate to the requirements of the 
problem and the current state of the solution. The 
selected plan is executed by performing the design actions 
specified by the plan. This may include computing and 
assigning specific values to attribute* of the device, check- 
ing constraint* to test the progress of the design, or in- 
voking sub-specialists to accomplish sub-portions of the 
design. Thus a design plan which refers to a sub- 
specialist is refined by passing control to that sub- 
specialist in its turn. DSPL also includes facilities for the 
handling of various types of plan failures, and for con trott- 
ing redesign suggested by such failures. 21 

Mission Planning as Routine Design 

We view tactical mission planning as predominantly a 
routine design task. The problem can be decomposed into 
the design of subcomponents of the mission plan, where 
each component can be designed in a fairly independent 
fashion. The Air Tasking Order is decomposed into 
various missions or groups of missions of known types. 
Each mission or group of missions can be planned rela- 
tively independently of the others, modulo resource conten- 
tion considerations. In both the mission planning and the 
mechanical design domains, each of the solutions to the 
subproblems must be appropriately combined into the 
solution for the problem which they decompose. Due to 
the well known limitations of human problem solving 
capacities, it is apparent that a human problem solver can 
be successful in such a situation only to the extent that 
she can decompose the problem into a manageable number 
of somewhat independent sub-problems, which can be 
solved separately and combined into a final solution. The 
MPA system uses DSPI, as a natural mechanism for 
representing the necessary knowledge. 

The MPA System 

The MI’ A system contains six specialists. The topmost 
specialist, OCA, accepts the mission requirements and ul- 
timately produces the Pinal mission plan. The OCA 
specialist divides its work between two subspecialists, base 
and aircraft. The base specialist is responsible for select- 
ing an appropriate base, while the aircraft specialist selects 
an aircraft type. The aircraft specialist has three sub- 
specialists, one for each of three aircraft types known to 
the MPA system. As needed, one of these specialists will 
select an appropriate configuration for its aircraft type. 

Problem solving begins when the OCA specialist is re- 
quested to plan a mission. Currently, the OCA specialist 
contains only a single design plan which first requests the 
base specialist, to determine a base, and then requests the 
aircraft specialist to determine (and configure) an ap- 
propriate aircraft for the mission. The current has* 
specialist simply selects a base from a list of candidate 
bases geographically near the target. The airrraft 
specialist uses considerations of threat types and weather 
conditions at the target to select an appropriate aircraft 
type and number for the mission. The aircraft specialist 
and its three configuration su Imperialists represent the 
most elaborated domain knowledge in the MPA system. 

Suppose the mission requirements call for a night raid. 
The plan sponsors for both the A- 10 arid F-l would rule 
out the possibility of using these aircraft, since (in our 
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domain model) neither of t hese aircraft have night flying 
capability. The F- 1 1 1 plan sponsor, since it is an all- 
weather filter with night capabilities, would not be ex- 
cluded. The plan sponsor for the F-lll, based on this 
and other considerations (range, ability to carry ap- 
propriate ordinance, target characteristics, etc.) would find 
the F-Ill suitable for the mission. The plan selector in 
the aircraft specialist, finding that two design plans have 
ruled out, would select the “suitable" F-lll design plan, 
and return this information to the specialist. The 
specialist executes the F-lll design plan, which includes 
setting the aircraft type in the mission template to 
“F*lir\ and invoking the F-lll configuration specialist 
which in turn decides an acceptable ordinance load for the 
F-lll for this mission. Once the configuration of the 
aircraft is known, the single aircraft probability of destruc- 
tion in the mission context can be computed. Finally, 
knowing the mission capabilities of each aircraft, the re- 
quired number of aircraft can be determined in order to 
achieve the required probability of destruction, and the 
necessary number of aircraft can be reserved from the 
proper unit. 

Summary 

We have argued the need for high-level task -specific tools 
for constructing know ledge- based problem-solving systems. 
We described our approach, based on the notion of generic 
information processing tasks, and described a toolset which 
can be used to efficiently build expert systems. Kxpert 
systems built according to this methodology have all of 
the ad vantages of control strategies and knowledge 
representations that are especially suited to their infor- 
mation processing tasks. Advantages include knowing 
what kinds of knowledge to collect, and where to put it 
away for use in the problem solver, efficient processing at 
run time, and explanations of system behavior in terms of 
strategies and problem solving goals keyed to the type of 
reasoning. We have illustrated the power of these ideas 
by paying special attention to a high level language called 
l)SPL, and have shown how it was used in the construc- 
tion of a system called MPA. for assisting with mission 
planning in the domain of offensive counter air missions. 
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Reliability is a serious problem in computer controlled robot systems. Although robots serve successfully in 
relatively simple applications such as painting and spot welding, their potential in areas such as automated 
assembly is hampered by programming problems. A program for assembling parts may be logically correct, 
execute correctly on a simulator, and even execute correctly on a robot most of the time, yet still fail 
unexpectedly in the face of real world uncertainties. Recovery from such errors is far more complicated than 
recovery from simple controller errors, since even expected errors can often manifest themselves in 

unexpected ways. ; , 5 

li Uhis oaoe r rn novel approacfWor improving robot reliability. Instead of anticipating errors, w^use 

knowledge-based programming techniques so that the robot can autonomously exploit knowledge about its 
task and environment to detect and recover from failures. \Afe describe preliminary experiment of a systerrf 
that ,we havo designed and constructed in our Robotics Lab. \ 77 _. e . ^ 

^ •rU'-n / ‘ ' i 

1. INTRODUCTION / 3 . 

We want to make robots more dependable so that they can be trusted when left unattended. This paper 
describes the design and development ol a robot system that continues to operate satisfactorily even after it 
encounters a serious error [Gin83b], [Gin85c]. 

Failures in achieving a task are the result of errors, but not every error produces an immediately detectable 
failure. Errors can occur at many levels, at the mechanical ievel (a joint becomes locked.), at the hardware 
level (a sensor does not function properly so that the robot is driven to exceed its joint limits), at the controller 
level, in the computer controlling the robot (either at the hardware or the software level), and in the 
environment. We are mostly interested in errors in the environment because they tend to be more 
unpredictable and difficult to characterize with mathematical models. We are interested in errors in the 
component parts used for the assembly, and in errors in the work cell (loaders, feeders, conveyor belts, 
tools). Our goal is to automatically detect problems caused by collisions, jammed parts, gripper slip, 
misorientation, alignment errors, and missing parts. 

In practice, robot systems that can recover from errors without human intervention do not exist today 
because robot control programs cannot handle the vast range of possible error conditions. It takes 
uncommon skill and experience to develop such a program, and the resulting program will then only apply to 
the specific robot task at hand. Moreover, the program may have to be largely rewritten to handle even a 
minor change in the robot's task (Car85), (Loz83|. 


A difficult problem in automatic error detection and recovery is detecting that something significant has 
occurred. Many events are usually reported to the robot controller but not all of them are significant. The 
same event may be important in some circumstances and almost irrelevant in others. Deciding when 
something is important is the first step in the error detection process. The second step involves detecting the 
cause of the error and its effect on the robot environment. Errors might appear a long time after what caused 
them happened making it more difficult to detect them. Some errors do not affect the execution of the task so 
they could be left unrecovered. Only after the cause for the error has been identified or, at least, after 
alternative plausible causes have been found the recovery activity can start. 


The robot system discussed here is geared towards industrial assembly tasks. The assembly to be 
performed is described in a robotics language. If an error occurs while the robot is performing the task, the 
robot detects the error, and dynamically plans the steps it must take to recover. To do this, the robot applies 
general knowledge about robots, and assembly tasks, plus specific information about the robot and the 
program in question. 
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Our approach simplifies robot programming by putting the burden for general error recovery on the robot 
system itself. The programmer can concentrate on the task at hand and minimize later maintenance if the 
robot recovers from most errors itself. This saves engineering time as well as robot downtime. 

The system described here works in conjunction with an existing robot programming language. We show 
later in the paper how we have used it wiffi animprementattorr oftheAMLHanguagetor the (8M 7565robot In- 
addition to developing the testbed with the IBM robot we have developed prototype components of a 
simulated system. The simulated version is currently more complete than the testbed version since it is much 
easier to control a simulated robot than a real one. 


2. INTELLIGENT ROBOT ERROR RECOVERY 

Our system operates in two phases: offline, and online. Figure 1 illustrates its structure. Each box represents 
a separate program that may run as an independent process. The Preprocessor prepares the assembly task 
for execution by the robot by generating an Augmented Program (AP). The AP Executive interprets the robot 
program and monitors the robot's operation. If a serious error occurs, it activates the Recoverer. The 
Recoverer examines information from the event trace and from the task knowledge base and devises a 
recovery plan. More details on the component of the system are provided later in the paper. 

Since the Preprocessor and the Recoverer both rely on symbolic computational techniques and do not 
require real-time performance they both reside on the same processor, referred to here as the Manager. 
Robot control operations require real-time response and reside on other processors. 

Design Features of the Testbed 

The design of the error recovery testbed incorporates four features of particular interest. First, the 
automated reasoning of the Manager and real-time functions of the AP Executive operate independently and 
execute on separate processors. Second, sensors are activated and monitored selectively according to the 
robot's current action. Third, sensor data are evaluated and assigned qualitative meanings at several levels 



Figure I: Rrror Recovery Syvtrm Components 
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of the system. Fourth, if the robot's task fails repeatedly, the AP Executive will handle successive restarts and 
recoveries without ill effect. 

The present configuration of the error recovery testbed manages an IBM 7565 manufacturing manipulator. 
The components of the error recovery Manager exist on an LMI Lambda processor. Motion control and 
sensor filtering for the IBM 7565 is implemented on an IBM Series 1 minicomputer using the AML 
programming system [Tay82]. The AP Executive resides on an MC68000 based personal computer system, 
an Apple Macintosh. 

The IBM 7565 is well suited for these experiments. It is a cartesian robot moving on linear tracks over a 
rectangular work cell. The gripper has six degrees of freedom and its jaw contains six strain gauges. The 
7565 is provided with the AML robot programming system which can be used to develop relatively 
sophisticated programs. AML provides facilities for monitoring the robot sensors and for performing robot 
motion subject to the presence of appropriate sensor readings. 

Separation ot reasoning and real-time 

The testbed utilizes separate processors for executing the reactive, or real-time, software components and 
for executing the reflective, or symbolic reasoning, components of the system. Providing separate processors 
for the real-time and the automated reasoning components of the system prevents time-critical software 
components from having to compete for computation time. The choice also allows us to choose a managing 
processor for its symbolic computation capabilities rather than its real-time capabilities. Since the AP 
Executive is the only component that interacts with the robot continuously, a large scale system could 
probably share a single Manager among several independent work cells, each with its own AP Executive. 

The Manager initiates a task by transmitting the appropriate Augmented Program (AP) to that cell’s AP 
Executive. In return, the AP Executive transmits the event trace of the task's execution. The physical 
separation of functions makes this information exchange particularly important. The AP must contain all 
information the AP Executive requires to operate the robot in the work cell. The event trace must contain all 
information relating to the task's progress necessary to reconstruct activities that took place in the work cell. 
Whenever feasible and appropriate, the event trace contains specific numerical sensor readings from the 
work cell. 


3. THE PREPROCESSOR 

Our system uses a manipulator level robot programming language to specify the task the robot is to execute. 
This description is given in the AL robot programming language [Gin85b], [Muj79J, though any other 
manipulator level language should work as well. Even though AL is not used in any commercial robot, it has 
many of features many languages have adopted. We have expanded AL to handle descriptions of objects so 
that it can be used to drive our graphic simulation system. We have chosen to use a "robot level language’ to 
describe the task rather than a "task level language’ because robots in real use are programmed with robot 
level languages. Starting from an existing and accepted level of language will allow us to grow to more 
sophisticated languages and yet to keep our ability to experiment with existing commercial robots. 

We assume that the task description is accurate and correct in the sense that a robot simulator would 
execute it reliably. Thus the only errors the system should expect will be introduced by real world 
uncertainties. 

We can't use the original AL program for online monitoring; we rely on an augmented version of the AL 
program to direct the job of monitoring the robot's activities. We call this program Augmented Program (or 
AP). The AP is structured as a finite state machine. The machine is represented with a directed graph in which 
the nodes are arbitrary states. Activities performed by the procedure are specified on the arcs connecting the 
states; you derive the sequence of actions in the procedure by traversing a series of arcs in the diagram. 
Each arc has 'events' or 'preconditions' attached to its activities. If a particular arc is leaving the state the 
system currently is in, then the activities on that arc are performed when the preconditions are satisfied. If two 
or more arcs leave a given state, then the AP Executive chooses the arc whose preconditions are satisfied 
first. This structure is especially useful in systems where several asynchronous events (sensor inputs) seiect 
and trigger subsequent actions. 

This approach lends itself readily to the representation of AL programs. The sequencing of instructions in the 
original AL program is replaced by transitions in the AP. The AP representation also relates explicit sensor 
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data to what the robot is supposed to be doing. Crucial sensor readings preceding some action in the AL 
program will correspond to the preconditions of the corresponding state transition in the AP. The 
preconditions on arcs leading out of a given state will correspond to the set of sensor readings to be 
monitored by the sensor handler. Thus, the set of significant sensor readings will change automatically as the 
robot proceeds through its task. 

Extracting high-level Intentions from an AL program 

Functioning of the AP Executive is highly dependent upon the information derived from the off-line portion of 
the system. In order for the AP Executive to interpret sensory data, the off-line component must provide the 
intent of the AL instructions. For example, if the intent of an instruction "move..” is to transport an object to a 
given location it is important to check for slippage by monitoring the touch sensors. If the intent is merely to 
move the arm, touch sensor data may be irrelevant The off-line component must be able to extract the 
semantics of the program. 

In our system methods from extracting intents of programs are based on syntactic matching and heuristics. 
By syntactic matching we mean identifying sequences of instructions that suggest operations such as 
grasping an object or releasing it. We have explored techniques of this type with excellent results. We can 
already handle difficult cases, such as, for instance, identifying when an object is grasped from inside a hole. 
Additional details can be found in [Gin85a], (Gin85c). 

Figure 2 shows a state transition diagram used in conjunction with syntactic matching to identify intentions of 
AL instructions. Arcs shown in gray indicate transitions that have no clear interpretation and that require user 
intervention. 



Figure 2 : Stale IrasuliuR Oiacram 

Constructing the Expected World Model 

The world model is another critical piece of information for the AP Executive. Suppose that a 'move objecf 
instruction is in execution when the proximity sensors of the hand are activated. Knowledge about the 
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placement of objects in the environment would help determine the cause of the impending collision. In 
addition to placement of all objects in the environment, geometric and non-geometric properties of objects 
will be important. If an object being transported is slippery, the chance of dropping it is increased. If it is quite 
large, chances of a collision may increase, and so on. 

The Preprocessor simulates the execution of the AL program to build an Expected World Model. In our 
current implementation the Expected World Model contains properties of the objects in the environment such 
as their position after the execution of each instruction, when they are moved, wich robot moves them, etc. 

A classical problem with creating a world model offline is caused by branches in the program that create 
alternative models. Our world model has a tree structure in which branching nodes are labelled by the 
condition that controls the branch. The existence of the Expected World Model reduces the amount of 
reasoning during the recovery because we can compare the current with the expected situation to get dues 
about problems. 

Translating an AL Program Into AP. 

The Preprocessor module has to provide information on how errors can be detected and what errors are 
likely for each instruction. For example, a 'move object' instruction might lead to a slippage error with high 
probability. This indicates that finger separation or touch sensors should be checked during the movement. 
Using the intent of the AL instructions, the Preprocessor generates in the AP program the appropriate 
sensory conditions to be checked to guarantee the correct execution of the instruction. 

We need also to know what sensor values are relevant to detect unexpected events. Some of this knowledge 
is obtained from the robot program (such as the position of the robot, or the opening of the fingers), some 
could be obtained from the CAO data base (such as the maximum pressure to exert when grasping an 
object). Some values cannot be known before executing the program. Sometimes the position of an object is 
identified only by sensors. Hence it is important to know that no value is known in advance and that sensor 
data will be used. 

A simple AL program and the corresponding AP are illustrated in Figure 3a and 3b. The task described in the 
AL program is a simple pickup operation. In the AL program WOKH indicates the robot hand and WOKA the 
robot arm. The example shows that we have modified AL to allow for different names of robots and different 

(d ((robot -do open wok 1.5)) 

( (open wok) 2) 

( (hand-error wok) 12)) 

(2 ((robot-do move -wok (-7.77 -14.41 4.4 -45 0 0))) 
((reach wok) 3) 

((hit wok) 12) 

( (joint -error wok) 12)) 

(3 ((imply create obj-block (-7.77 -14.41 4.4 -45 0 0) ) 
(expect grasp wok obj-block) 

(robot-do center wck) ) 

( (center wok) 4) 

( (crush wok) 12) 

( (hand-error wok) 12) ) 

(4 ((imply grasp wok obj-block) 

(expect carry wok cbj-block) 

(robot-do move wck (-7.77 - 14.41 7 -45 0 0 ) ) ) 
((reach wok) 5) 

( (hit wok) 12) 

( (untouch wok) 12) 

( ( joint-error wok) 12)) 


CPEM WOKH TO 1.5; 

MOVE WCKA TO FRAME (ROT (ROLL, -45), 
7E.T0R (-7.77, -14.41, 4.4)); 
'ENTER WOKH; 

MOVE WOKA TO FRAME (ROT (ROLL, -45), 
TSITOR (-7.77, -14.41, 7) ) ; 


Figure 3a: Example AL Program 


(other states in the program ... ) 


(12 ((imply error) (robot-do notice operator)))) 


Figure 3b: Augmented Program derived from Figure 3a 
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ways of expressing rotations. In the corresponding AP states are numbered. Each state contains a collection 
of entries. The first of them describes the action the robot has to do ("robot-do'') and the meaning of the 
instruction in the physical world. In particular, "expect" shows what is expected to happen at the end of the 
state if all goes well, "imply" shows logical deductions about objects or the robot that can be made from the 
intentions extracted during the preprocessing phase. Each other entry specifies a condition to be checked 
using sensor data and the next state if the condition becomes true. Since the Preprocessor generates the 
conditions using knowledge about the intention of each robot action the same AL statement usually 
generates different conditions. As an example we can look at the states 2 and 4 in Figure 3b. 

We should note that there is only one error state in the AP. When an error is detected a transition to the error 
state is generated. We do not use specific error states because we need to check causes of the error to find 
the recovery procedure. We consider the sensor data obtained as symptoms of the error not as its diagnosis. 
Often software that handles failures does not make the distinction between symptoms and errors or it 
assumes that there is a deterministic mapping between symptoms and errors. 


4. THE AP EXECUTIVE 

The AP Executive is responsible for maintaining an accurate picture of what the robot does. The Recoverer 
needs to know the robot's situation but the potential complexity of the robot's activities make it hard to derive 
the necessary details from the program's state. It is easiest for the AP Executive to keep track of what the 
robot is doing and what objects it is manipulating. The AP Executive can then provide the robot's recent 
history and a catalog of objects in the workspace when an error occurs. 

The AP Executive does more than simply observe and report on the robots actions. It takes responsibility for 
issuing commands to move the robot. When the AP says that a robot action is to occur, the AP executive 
sends the command to the robot. The AP Executive tracks the robot's activities by monitoring data from the 
robot's sensors. The sequence of sensor data yields an event trace from which we get the robot's recent 
history. 

The Workspace Model 

To recover from an error, the system needs to know what objects are in the workspace and where the 
objects are. At the time of error it should be easy to find out where the AL program failed and what the values 
of the program's variables are. Unfortunately, we can't deduce the state of the workspace from the state of 
the program. The program just doesn't keep the right kind of information. But it is possible to deduce when 
and how the AL program manipulates objects. To monitor objects in the workspace the AP Executive has to 
be told when an object is acquired, grasped, moved, and discarded. Typically the robot 'acquires' an object 
from a part dispenser, moves it somewhere, and maybe ‘discards' a part by placing it on a conveyor. This is 
sufficient to keep track of what objects are in the workspace and where they are. The workspace model 
update process can then follow objects by monitoring such activities in the event trace. 

The minimum workspace model is a catalog of objects and their locations. Along with the robot's most recent 
activities, this model gives enough information to determine what was going on at the time of an error. The 
Recoverer uses this information to key into more information about the object stored in the offline world 
model. More details about the Workspace modeller can be found in [Smi86aj. 

Sensor management through filtration 

In the error recovery testbed, sensor information is filtered several ways. Initially, the task's AP identifies 
specific sensor information that is significant to the execution of that task. This information is given in terms of 
sensory events specified symbolically that can cause state transitions in the AP. The sensory information is 
used both to identify potential state transition events and to filter sensory information for the event trace. This 
information is also passed to "sensor filter* tasks that activate appropriate sensors and map sensor values 
into events significant to the progress of the robot's task. 

Augmented Programs are instruction sequences structured as finite automatons. AP transitions are caused 
by discrete events, so a robot's progress at its task depends on the occurrence of events that cause 
appropriate transitions. Significant sensory readings must be mapped into events that cause state transitions. 
This mapping provides one form of sensor/ filtration: sensory readings are reported only when the value is 
significant to the progress of the robot's task. In some cases the identity of the event is the only specific 
sensory information returned and in other cases numerical data is included in the event trace as well. 
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Each AP state contains event predicates identifying sensor readings that would be significant to the 
successful execution of that state. The AP Executive passes the information in the event predicates to the 
appropriate sensor filters before initiating robot motion. The sensor filters activate 
app 

ropriate procedures so that necessary sensor readings will take place. The AP Executive omits all sensor 
information from the event trace that is not identified in the AP as being significant. 

Grip sensor filtering on the IBM 7565 is implemented using "monitor" facility of the AML language [Tay82j. 
Monitors are used to define ranges of sensor values that can activate user-defined procedures or terminate 
robot motions. When initializing the IBM 7565, the AP Executive defines a set of monitors for classifying 
gripping forces and associates each monitor with an AP event type. The numerical values used for classifying 
gripping force depend on the objects being used in the robot's task and the actions performed on them, so 
these values may be adjusted during initialization. 

When the AP Executive gives the IBM 7565 a motion command, it also specifies a set of monitors to activate. 
The AML system collects the appropriate sensor readings for each active monitor and "trips" the 
appropriate monitor if its sensor enters the monitor's defined range. This terminates the motion in progress 
and generates a message to the AP Executive identifying the qualitative value of the sensor reading, as 
determined from the monitor that was tripped. If no monitor terminates the active motion, a similar message 
indicating uninterrupted completion is sent instead. The AP Executive then generates a sensor event and, if 
necessary, updates the event trace and performs an AP state transition. 

Since the IBM 7565 is normally programmed in the AML language, the AP Executive translates AP 
commands and sensor specifications into an AML compatible form. Since AML is a manipulator level 
language, the mapping of robot actions from the AP form is straightforward. Mapping sensor operations is 
more complex since APs specify symbolic sensor readings. Figure 4 shows an example of the transformation 
of a "move" operation in an AP state into the corresponding AML commands executed by the robot. The 
desired destination and the desired AML monitoring sets to be activated (E_HIT and E_UNTOUCH) are 
passed to the APM procedure. This procedure, written in AML and executing on the IBM series 1 , performs 
the MOVE operation and the related filtering for the 7565's sensors. The procedure activates the appropriate 
monitors and performs the motion subject to the selected monitors. 

Qualitative sensor interpretation 

Although the event trace often provides numerical sensor data, such information is not of primary importance 
when reasoning about the robot's activities. To meet this need, the error recovery system assigns symbolic 
meanings to numerical sensor values in a number of ways. Spatial locations and critical dimensions are 
assigned symbolic names. Gripping forces are assigned qualitative values according to the range in which a 
force value falls. 

Qualitative classification of sensor data often serves a second purpose as well. When executing an AP, the 
AP Executive responds to events in terms of symbolic classifications. Upon successful completion of a motion 
command the AP responds to a "reach" sensor event instead of examining and matching the robot's reported 
destination. If the gripper drops an object and the gripping force drops to a small value, the AP responds to an 
"untouch" sensor event instead of testing the specific force value. The classification of sensor values into 
different types of AP events is performed by a sensor filter procedure that operates on the behalf of the AP 
Executive, as described above. 

Ideally, qualitative classification of sensor data should be performed by a component of the Manager and 
exploit its increased computational capabilities and knowledge bases. Symbolic classification of spatial 
information is an example of this. The robot's sensor filter identifies whether successful completion of a robot 
motion caused the robot to reach its sensed position, but the filter does not try to identify the location in terms 
of the robot task's overall goals. Identification of the particular location is handled by the Manager's work cell 
modeller. 

When an error occurs the Manager produces a model of the robot work cell in terms of its probable state and 
its intended state. Locations visited by the robot or by objects in the work cell are assigned symbolic 
identifiers, and a history is produced of visits for each location, object, and robot gripper in the work cell. Error 
recovery planning consists of producing a sequence of robot actions to change the work cell from its 
erroneous state to its intended state. 
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0 INCREASING GRIP FORCE 



touch grasp crush 


Figure 5: Qualitative interpretation of grasping forces 


Figure 4: Converting from AP statements to AML statements 


Qualitative interpretation of gripper forces, on the other hand, must be performed by a sensor filter. Gripper 
forces, when they are significant, determine whether the gripper is touching an object and holding with an 
adequate force. Identification of appropriate touching and grasping forces must be communicated to the AP 
Executive so that appropriate AP state transitions occur depending on the gripping forces encountered. The 
sensor filter classifies gripping forces into specific ranges according to the robot's current action. Each range 
corresponds to a type of sensing event that can be produced by the gripping force sensor. Figure 5 shows an 
example of that. 


5. THE RECOVERER 

We are interested in discovering causes for errors and errors might appear a long time after what caused 
them happened [Smi86b]. Error interpretation becomes more difficult as the complexity of the task increases. 
For example, consider a task where a robot moves cubes from a feeder to a shipping pallet, twelve at a time. 
What might happen if the the cube falls from the gripper and lands on the pallet, knocking another object off? 
Most of the failure reason models available only apply to the objects and situations directly related to the 
sensor reading indicating the error. The robot thus only associates an error with a part if it uses its sensors on 
the part and finds an error. The lost part won't be missed until someone down the line tries to unload the 
pallet and finds it one part short. 

The symbolic model ol the work cell constructed before execution of the task and the trace of events are 
used here. When something unexpected happens we can trace the error back until we find critical 
measurements or assumptions made that were not supported by sensor data. For instance, in general we 
assume that if something is left in a stable configuration it will remain there until a fact appears that shows the 
configuration has changed. If we discover later that the object moved it means that something happened to 
move it that was not explicitly noted. 

We have developed methods for symbolic tracking of objects from event traces. Common sense heuristics 
are used for symbolic tracking. For instance, if the robot is holding an object and the robot moves, then we 
know that the object moves to the same place. Tracking an object in real time with sensors is too expensive, 
unless we know where to look for it. and how it looks like. Symbolic tracking is less expensive from a 
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computational point of view, and helps in reducing the number of possible causes for errors. After that 
number has been reduced we can get additional sensor data to guarantee that the correct cause for the error 
has been identified. 

It is curious to observe that most of the Al work in planning has concentrated on checking preconditions 
before executing every action to guarantee that they are satisfied in the current state of the world. 
Postconditions are not checked, but are used solely to update the world model. So an error can go 
unchecked for a while and can be detected only when it affects the preconditions of another action. We check 
selected conditions after the execution of each instruction to guarantee that failures are identified as soon as 
they appear. This still does not solve the problem of errors caused by the robot during the movement that 
require different sensors to be checked. For instance, if the arm bumps into objects during a transfer motion 
without losing the part it is carrying no error will be reported. This requires failure reason analysis when an 
error is found. 

Once an error has been detected, the recovery process can start using a trace of relevant events and 
whatever information is available about the task to determine the causes and effects of the error. It is only 
after the cause for the error has been identified or, at least, after alternative plausible causes has been found 
the recovery process can start. 

To be more specific, if an AP state transition leads to an error state, a message to that effect is 
appended to the event trace and the trace is passed to the Recoverer component of the Manager. The 
Recoverer generates a model of the current work cell's state and of its desired state. This model is used to 
produce a recovery plan in the form ot AP states to be appended to the task's existing AP. The Manager 
passes these additional states back to the AP Executive where they are executed. If the new states each 
execute successfully, they will lead the task back to a state in the original AP. 

To successfully effect recoveries in this manner, the Recoverer requires a copy of the task's AP and the 
information in the event trace. The Recoverer can also exploit knowledge about the robot's task, the parts 
involved, and the work cell to produce the recovery plan. To simplify experiments with error recovery as well 
as for improved performance in industrial situations, the testbed's AP Executive can handle repeated failures 
and subsequent recoveries by a robot task. The AP Executive can also display messages on the AP 
Executive's display screen for explaining error diagnoses or for instructing the robot's operator. 

During normal execution, the AP Executive contains a copy of the robot task's AP. if an error recovery occurs, 
the Recoverer passes additional AP states to the AP Executive. These additional states do not replace 
existing states in the AP; they are appended to them. To recover, the AP Executive resumes task execution 
with the first of recovery states passed to it. Once the recovery execution begins, the AP Executive treats the 
recovery states identically to the states in the original AP. If another failure occurs, whether during the 
recovery or after completing the recovery, the AP Executive again reports the failure to the Recoverer and 
resumes executiun when it receives a set of recovery states. 

For example, if the robot loses a part, it can attempt a recovery by opening the gripper, moving to the work 
cell surface, and trying to grab the part. If the part is there, the recovery can proceed. If the grasp fails, the AP 
Executive simply informs the Recoverer which can then produce another recovery plan and try again. 

The ability to do multiple recoveries allows the Recoverer to profit from mistakes in a recovery plan. When 
faced with multiple recovery choices, the Recoverer can choose the one that is most likely to reduce 
uncertainty about the state of the work cell. The Recoverer can also produce recovery plans with the sole 
purpose of taking sensor readings in the robot work cell. If the Recoverer needs to probe a specific spatial , 
location it can produce a recovery plan that performs the desired sensor reading and then immediately fails. 
The resulting event traces will increase the amount of information in the work cell model and the unsuccessful 
recovery will not prevent a subsequent recovery from being attempted. 

Another useful feature during error recovery is the AP Executive's ability to display messages for the robot's 
operator. These messages are produced by statements in the AP and thus may be generated by the 
Recoverer. This facility allows the Recoverer to request specific operator intervention when necessary. This 
permits experiments with failures that tax the available sensory or reasoning facilities. In the Testbed it also 
allows experiments with primitive Recoverer software that simply diagnoses the problem and asks for 
operator intervention. This capability may also have worthwhile industrial applications: the Recoverer could 
produce messages to guide the robot’s operator in manually correcting problems in a complex and unfamiliar 
assembly. An example is shown in Figure 6. 
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Figure 6: Operator display by AP Executive 


6. RELATED WORK AND SUMMARY 

Our approach differs from other approaches in significant aspects. The current state of the art in industrial 
robots is that either the robot executes its task regardless of its success or it quits every time it encounters 
something unexpected [Luh83]. A better approach is to handle the situation by preprogramming error 
checking and error recovery procedures for every probable error [Bon82], [Gin83a], [Gin85b], [Tay82] This 
is an expensive method both in engineering and in robot computational resources. It : also easy to forget 
some important checks. 

Since it is difficult to consider all possible errors, many of which might never happen, another method is to 
generate from the task level description a program that is guaranteed to be correctly executed even in the 
presence of uncertainties in the environment. This requires models of robot kinematics and dynamics, and 
models of physical properties of objects such as friction. This approach has been applied only to fine motions 
for specific tasks such as insertion operations (Loz84]. Modeling uncertainties [Bro82] and taking into account 
errors in the model [Don86] helps but the real world is so complex that it might not be worth developing 
sophisticated models of it. 

Much previous research in Artificial Intelligence has centered on detection and correction of errors in 
simulated robot systems [Wil84]. These studies all make a number of assumptions: knowledge about events 
is correct, each action produces precisely defined postconditions, there are no uncertain data, correct 
predicates are generated from sensor data every time they are needed, and sufficient knowledge is provided 
to take into account all the possible states of the environment. These assumptions are too strict to be realistic. 

A few exceptions exist. The most notable is STRIPS [Fik71] the system used to control the mobile robot 
Shakey. Srinivas [Sri76] [Fri77] has designed a system for analyzing the causes of failures in robot programs 
and for replanning the robot activity. More recently, work has been done on monitoring the execution of 
programs with real robots [Lee83], [Lop86], There is a growing interest in modeling sensors [Fox83], [Hen84], 
and planning for their use [Doy86j that will provide needed background for work on error detection and 
recovery. 
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Our approach is more similar to the way people handle errors and unexpected events. By relating events to 
general knowledge human beings can identify unexpected situations and by applying common sense and 
domain specific knowledge they can find solutions to situations never seen before. The key to human 
performance is in the knowledge about the environment and about the specific task at hand. We want to do 
something similar for assembly robots. Since the domain is limited and reasonably constrained the amount of 
knowledge needed can be managed by using present technology [CAR84]. 
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Abstract- 
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In real-world domains (e.g., a mobile robot environment), things do not always proceed u planned, so it is important to develop better execution-monitoring 
techniques and replanning capabilities. This p aper d essei b es jhese capabilities in the SIPE planning system/The motivation behind SIPE is to place enough 
limitations on the representation so that planning can be done efficiently, while retaining sufficient power to still be useful. This work assumes that new* 
information given to the execution monitor is in the form of predicates, thus avoiding the difficult problem of how to generate these predicates from 
information provided by sensors. 

The replanning module presented here takes advantage of the rich structure of SIPE plans and is intimately connected with the planner, which can be 
called as a subroutine. This allows the use of SIPE's capabilities to determine efficiently how unexpected events affect the plan being executed and, in many 
cases, to retain most of the original plan by making changes in it to avoid problems caused by these unexpected events. SIPE^ also capable of shortening 
the original plan when serendipitous events occur. A general set of replanning actions is presented along with a general replanning capability that has been 
implemented by using these actions. 


1 Introduction 

A principal goal of our research in planning and plan execution is the development of a domain-independent, heuristic system that can plan an activity 
and then monitor the execution of that plan. Over the last two years we have designed and implemented such a system, called SIPE (System for 
Interactive Planning and Execution Monitoring). 1 The basic approach to planning is to work within the hierarchical-planning paradigm, representing plans 
in procedural networks - as has been done in NOAH |6| and other systems. Several extensions of previous planning systems have been implemented, 
including the development of a perspicuous formalism for describing operators and objects, the use of constraints for the partial description of objects, 
the creation of mechanisms that permit concurrent exploration of alternative plans, the incorporation of heuristics for reasoning about resourcse, and the 
creation of mechanisms that make it possible to perform deductions. 

Given a description of the world and a set of operators that it can apply, SIPE can generate a plan to achieve a goal in the given world. (Operators are the 
system's description of actions that it may perform.) However, in real-world domains, things do not always proceed as planned. Therefore, it is desirable 
to develop better execution- monitoring techniques and better capabilities to replan when things do not go as expected. In complex domains it becomes 
Increasingly important to use as much as possible of the old plan, rather than to start all over when things go wrong. 

This paper describes the execution-monitoring end replanning abilities that have recently been incorporated into the SIPE system. The particular advantages 
that can be obtained by using the rich structure in our plan representation are shown, as well as more general problems. The environment of a mobile 
robot has been used as a motivating domain in the development of some of the abilities here, though the implementation has been corned out in a general, 
domain-independent manner. This document does not describe resources, constraints, plan generation, and otbv features of SIPE, nor does it attempt to 
justify the basic assumptions underlying the system. The interested reader is referred to (10] for this. 

The problem we are addressing is the following: given a plan, a world description, and some appropriate description of an unanticipated situation that 
occurs during execution of the plan, our task is to transform the plan, retaining as much of the old plan as is reasonable, into one that will still accomplish 
the original goal from the current situation. This process can be divided into four steps: (t) discovering or inputing information about the current situation; 
(2) determining the problems this causes in the plan, if any, (similarly, determining shortcuts that could be taken in the plan after unexpected bat helpful 
events); (3) creating “fixes* that change the old plan, possibly by deleting part of it and inserting some newly created subplan; and (4) determining whether 
any changes effected by such fixes will conflict with remaining parts of the old plan. Steps 2 and 4, and possibly 3 as well, involve determining which aspects 
of a situation later parts of the plan depend upon. Part this problem ia an instance of the standard truth maintenance problem, and SIPE's solution is 
described in Section 4. In SIPE, Step 4 becomes part of Step 3, is only those fixes that are guaranteed to work are produced. In addition, serendipitous 
effects are used to shorten the original plan in certain cases. 

The major contributions of the replanning module in SIPE result from taking advantage of the system's rich plan representation and from imbedding it 
within the planning system itself, rather than implementing it as an independent module. This provides a number of benefits, of which the most important 
follow: (1) the replanning module can exploit the efficient frame reasoning mechanisms in SIPE to discover problems and potential fixes quickly; (2) the 
deductive capabilities of SIPE are used to provide a reasonable solution to the truth maintenance problem described above; and (3) the planner con be 
called as a subroutine to solve problems after the replanning module haa inserted new goals into the plan. 

Another important contribution is the development of a general replanning capability (see Section 6) that has been implemented by using a general set of 
replanning actions. In general, recovery from an arbitrary error poses a difficult problem. Often very little of the existing plan can be reused. One can 
always foil bock on solving the original problem in the new situation, ignoring the plan that was being executed. The re; anning port of SIPE, however. 

‘The research reported here is aupported by Air Fort# Office of Scientific Research Contract F49420-79-C-018S. 
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Figure 1: Control and Data Flow in SIPE Modules 

tries to change the old plan, whila retaining a a much of it a* possible. Sinca tha problam is so difficult, ona would not axpact vary impraaaiva performance 
from a ganaral rtplannar ancb u SIPE’s. 

Bat Ur parformanca raqniraa domain -specific information for daalinf with errors. In many domains, tha typos of arron that ara commonly sncouatersd can 
ba predictad (e.g., tha robot arm dropping something it was holding, or missing somathing it was trying to grasp). For thin reason, tha ganaral raplaanai 
is based on a number of general replanning actions (i.e., actions that modify a plan in ways that ara useful for handling unexpected situations) that can be 
referred to in a language for providing domain-specific error recovery instructions. Section 0 gives tha outline of such a language. 


1.1 Assumptions 

SIPE assumes that information provided about unexpected event* is correct and, to a certain extent, complete. This assumption avoids many of the hardest 
problems involved in getting a planner auch as SIPE to control a mobile robot. The challenging task of determining how to generate correct predkatei 
from information provided by the sensors is not addressed. We expect the translation of the information from the robot’s sensors (s.g., ths pixels from the 
camera or the range information from ultrasound) into ths higher-level predicates used by the planner to be crucial in applying a SIPE-like plaansr to i 
mobile robot. We hope to deal with this problem in ths nsar future. 

In a mobile robot domain, it may often be important to expend considerable effort in checking for things that might have gone wrong beeidee the unexpected 
occurrence already noticed. There is a substantial tradeoff involved here, as interpreting the visual input of unanticipated scenes may be expenesve. TV 
research described in this paper does not examine this problem either. It assumes that nothing has gone wrong besides reported errors and effects tha car 
be deduced from them. The problem of uncertain or unreliable sensors or information is also largely unaddressed, except that some predicates a md variable 
may be apecified as unknown. What is discussed here is what to do with new information in the form of predicates (if we assume tha euch predicates haw 
somehow been discovered). Replanning appropriately with such information is an essential pvt of u« overall solution. 


1.2 Overview 

Figure 1 shows the various modules in the SIPE execution- monitoring system. The solid arrows show which modules call which other*. The broke* snrowi 
■how the flow of data and informatioa through the ayetem ae it replane for an unexpected situation. These arrows are labeled with a description of th» 
data being paseed. 

The general replanner is givsn the list of problsms found by ths problem rocog niser and trie* certain replanning action* in variou* case*, but will mot ahrsyi 
find a solution. The general replanner changes the plan bo tha it will look like an unsolved problem to the standard planner in SIPE (e.g. t by insert ia< 
new goals). After the replanner has dealt with all the problems that were found, the planner is called on the plan (which now include* unsolved goals). 1 
it produces * new plan, this new plan should solve correctly all the problems tha were found. 

Section 2 of this paper describes ths fsaures of plan repre s s ntaion in SIPE that are relevant to its replanning capabilities. To describe uxexpectsc 
situations, a user (at present a human, but eventually this may be a program controlling and interpreting the robot’s sensors) can enter arbitrary predicate 
at any point in tbs exsention or can specify certain things as unknown. Section 3 describee the details of this process. Ones the description of the enexpectec 
aituation has been accumulated, the execution monitor calls a problem recogniser described in Section 4, which returns a List of all ths problem* it detect 
in the plan. The replanning actions ve described in Section 5 and the general replanner in Section fl. Section 7 shows examples of the general replanae 
in operation. 


2 Plans in SIPii 


Plans in Sli E ve represented as procedural networks |6], with temporal information encoded in the predecessor and successor links between sodes 
The plan rationale, of primary importance to the execution monitor, is encoded in the network by M A INSTEP links between nodes and by the use o 
PRECONDITION nodes (described below). MAINSTEP links describe how long each condition that has been achieved must be maintained. A contex 
must also be given to specify a plan completely, as the network contains choice points from which alternative plans branch. The typ— T.ndzz thii 
In plans ve described below to the extent necessary for understanding the execution-monitoring capabilities. 

SPLIT and JOIN nodes provide for pvallel actions. SPLITt have multiple successors and JOINs have multiple predecessors so that partially ordered plaa 
can be produced. JOIN nodes have a parotid- postcondition slot, which specifies the predicates that must all be true in the situation represented by th 
JOIN node. If a JOIN node originally has N predecessors, there will be N conjunctions of predicates that must all be true at the JOIN node. (Son* 
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(a) Plant at Different Levels 



(b) Wedge* Used by the Execution Monitor 
Figure 2: SIPE Plan Viewed from Different Perspective* 


branches may have been linearised, eo there may be fewer than N predecceeon after planning.) It it easier to record this at the JOIN node (than by having 
previous node* point to the JOIN as their purpose), since a failed parallel postcondition can more easily be retried during execution monitoring if there is 
easy access to all parallel postcondition*. The parallel postcondition slot is filled only when the JOIN is first introduced into the plan; it is not updated u 
more detailed levels of the hierarchy are expanded. As long as the highest level predicates are as desired, it is assumed that the lower-level predicates are 
irrelevant. 

COND, ENDCOND, and CONDPATTERN nodes implement conditional plans. COND and ENDCOND are similar to SPLIT and JOIN, but each successor 
of ths COND begins with a CONDPATTERN nods that determines which successor will be executed. 

CHOICE nodes denote branching points in the search space. They have multiple succeesors, but the context selects ons of these as being in the curat 
plan. Constraint* on variables may be posted relative to this choics point. Thus, if the part of a plan after a CHOICE node is removed, the corresponding 
choice point in the context should also be removed so that constraints that are no longer valid will be ignored. 

GOAL nodes do not occur in final plans, since they represent problems that have not yet been solved. A GOAL node specifies a predicate that must be 
achieved, bat which is not tree in the situation represented by its location in the procedural network. Replanning actions will insert GOAL nodes in the 
plan. Each GOAL node has a MAINSTEP slot, which denotes a point later in the plan that depends on the GOAL. (This describes the rationale for baring 
the GOAL in the plan.) Each goal must be maintained as trus until the node which is its MAINSTEP is executed. A MAINSTEP slot can have ths atom 
PURPOSE as its value, denoting that the given predicate is the main purpose of the plan, not preparation for some later action. 

PHANTOM nodes are similar to GOAL nodes except that they are already true in the situation repieseutvd by their location in the procedural network. 
They are part of the plan because their truth must be monitored as the plan is being executed. They also contain MAINSTEP slots. 

PROCESS nodes represent actions to be performed during execution of the plan; they also have MAINSTEP slots, as do PHANTOM and GOAL nodes. 
In a final plan, all PROCESS nodes will denote primitive actions. (There are also CHOICEPROCESS nodes, which are like PROCESS nodes except that 
they have a list of actions, one of which must be performed.) 

PRECONDITION nodee provide a list of predicates that must be true in the situation represented by their location in the procedural network. Operators 
may specify preconditions that must obtain in the world stats before the operator can be applied. The concept of precondition here differs front its 
counterpart in some planners, since the system will make no effort to render the precondition true. A false precondition simply means that the operator is 
not appropriate. Conditions that the planner should make true (and therefore backward-chain on) can be expressed as goal or process nodes. 

By distinguishing between PRECONDITIONS, GOALS and PROCESSES, we effectively encode metaknowledge about how to achieve goals. SIPE will use 
any means to solve a goal node, only the operators listed to solve a process node, but no operators to solve a PRECONDITION node. Thus, a precondition's 
becoming false does not mean that it should be made into a goal; rather it means that the part of the plan produced by the operator which initially innerted 
this precondition is invalid. PRECONDITION nodes also hslp encode the rationale of a plan, since in effect they mean that the part of the plan associated 
with them (see below) was produced on the assumption that the predicates in the pr* .adit ion were true. 

In addition to the *horisontal* MAINSTEP, predecessor, and successor links within on* level of a plan, there are "vertical” jinks between different levels 
of the hierarchy. EUch node that is expanded by the application of an operator has descendant links to each node so produce I T he descendant nodes in 
turn have ancestor links back to the original nod* one level higher in the hierarchy. Starting with a node that was expanded '<v an operator application, 
a wedge of the plan is determined by following all its descendant links (in the current context) repeatedly (i.e., including descendants of descendants, a ad 
so on) to the lowest level. (This definition of wedges is the same as that used by Sacerdoti [6].) Figure 2 depicts this graphically, with the largs boxes in 
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Pat- (b) representing wedges. The node ori finally expanded by an operator application i« called the top ot the wedge. A wedge with its top at a high level 
in the hierarchy will generally contain many lower* level wedgee within itself. The only nodes that can be the tops of wedges are GOAL, PROCESS, and 
CHOICEPROCESS. 

Since PRECONDITION nodes are created only when an operator is applied, the part of a plan associated with a PRECONDITION node can be fouad by 
ascending along the ancestor linhs to the point at which the precondition first became part of the plan (once inserted, PRECONDITION nodes are copied 
down from level to level). The node that was expanded by an operator to create this precondition is one level higher than where the first PRECONDITION 
node appears and is the top of the wedge associated with each of the PRECONDITION nodes that are copied from the first one. 


3 The Input of Unexpected Situations 

Daring execution of a plan In SIPE, some person or computer system monitoring the execution can specify what actions havs been performed sad what 
chan gee have occurred in the domain being modeled. SIPS changes its origin*! world mode! permanently so as to show the effects of actions already 
performed. At any point during execution, the system will accept two types of information about the domain: (1) an arbitrary predicate whose arguments 
are ground instances, that is now true, false or unknown; and ( 2 ) a local variable asms that is now unknown. SIPE first checks whether the truth-values 
for the new predicates differ from its expectations and, if they do, it applies it* deductive operators to dedu ce more changed predicates. 

It is important to note that the input Ug of predicates does not solve the "pixels to predicates" problem, which is ths crucial issue in using a planner such 
as SIPE to control ths actions of a robot. This problem involves translating the input vf the robot's sensors (s.g., the pixels from the camera or the range 
information from ultrasound) into the higher-level predicates used by the planner. The research described here is concerned with what must be done * <th 
the predicates once they have been established but does not tabs up the question of how to determine them automatically. We hope to address this la" r 
task in the near future. 


3.1 Unknowns 

Unhnowne are a new addition to SIPE, as it previously assumed complete knowledge of the world. Having unknown quantities constitutes a fundamental 
modification because even the method of determining whether a predicate is true must be changed. If the truth-values of critical predicates art unknown, 
the planner will quickly fail, since none of the operators will be applicable. (Neither a negated nor an unnegated predicate in a precondition will match 
an unknown one.) Operators can requirs predicates to bs unknown as part of their precondition in case there are appropriate actions to take when things 
are uncertain. Conditional plans have also been implemented as part of the execution-monitoring package in SIPE; thus, an operator might produce a plan 
with an action to perceive the unknown value, followed by a conditional plan that specifies the correct course of action for each poeeible outcome of the 
perception action. The deductive capabilities have also been enhanced so that operators can deduce that something is unknown. 

The ability to specify variables as unknown is simply a tool provided by the system that will presumably be useful in some domains, particularly in a mobile 
robot domain. Ths idea behind this tool is that ths location of an object may become unknown during execution. Rather than make predicates unknown, 
which may cause the application of operators to fail, we simply say that the variable repreeenting the location is instantiated to the atom UNKNOWN, 
rather than to its original location. All predicates with this variable as an argument may then still match as if they wer* true. Thus, the syeUm can 
continue planning as if the location were known. The only restriction is that no action can b# executed that uses an unknown variable as an argument. 
When such an action is to be executed (e.g., go to LOCATIONl), then the actual instantiation of tbs variable must be determined before tbs action is 
executed (possibly through a perception action). Note that it would be incorrect to continue planning if the trath-valuee of important predicates depended 
on the instantiation of the location variable. It is the responsibility of the user not to use the unknown variable if predicates depend on the latter'e value. 


3.3 Interpreting the input 

The user need not report all predicates that have changed since many of these may be deduced by SIPE’s deductive operators. Ths system's deductive 
power has been increased recently (see next section) so many effects can be deduced from certain critical predicates. SIPE does not check for additional 
unexpected predicates. Alternatively, we could decide on some basis (which would have to be provided as part of the domain-specific description) just how 
much effort to expend on perception actions to find out whether more than the minimum hae gone wrong. For example, if we are told that (ON A B) is 
not true when w« expected it to be, we might want to check to see if B is where we thought it was. As it is, SIPE will simply deduce that B is clear (if no 
other block is on B) and will not try to execute actions to make further checks with regard to ike world. This latter procedure could be very »p«aiiv« for 
a mobile robot in the absence of good domain- specific knowledge about what was worth checking. 

There is a problem with unexpected effects in deciding how they interact with the effects of the action that was currently being executed (e.g., did they 
happen before, during, or after the expected effects?). Our solution to this problem is to assume that the action took place as expected and to simply insert 
a "Mother Nature* action after it that is presumed to bring about the unexpected effects (and thinge deduced from them). The system assumes that any 
effects of the action being executed that did not actually become true are either provided or can be deduced from the information provided. This solution 
interfaces cleanly and elegantly with the rest of the planner and avoids having to model the way in which the unexpected effects might interact with their 
expected counterparts. 


4 . Finding Problems in a Plan 

Having just inserted a MOTHER-NATURE node (MN node) in a plan being executed, SIPE must now determine how the effects of this node influence the 
remainder of the plan. There are two aspects to this: the Erst involves planning decisions that were based on the effects of this node, and the second involves 
deductions about the state of the world that were based on those effects. Section 4.1 describes the problem recogniser in SIPE, which finds all problems in 
the remainder of the plan that might be caused by the effects of the MN node. Because of the rich information content in the plan representation (including 
the plan rationale), there are only eix problems that must be checked. As shown in Figure 1, the problems found by the problem recogniser are given to 
the general replanncr. The problem recogniser also notices possible serendipitous effects. 

The second aspect mentioned above involves solving the traditional truth maintenance problem. Many effects deduced later in the plan may no longer 
be true if they depended on predicates that art negated by the MN node. The validity of such deductions must be checked so tbat the remainder of the 
plan represents ;h« state of the world accurately. Section 4.2 describes how SIPE solves this problem, correctly updating deductions later in the plan. 
Deductions that are changed may or may not cause problems that should be recognised by the problem recogniser. If such problems are generated, they 
will be found by the problem recogniser described in Section 4.1, since the deductions are correctly updated before the problem recogniser is called. 
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Figure 3: Block* World ProbUm and Plan 


4.1 Problems found by the problem recognizer 

All occurrence* of the aix problem* listed below are found by the problem recogniser. Theee problem* constitute the only things that can go wrong with a 
plan in SIPE after addition of a MN node at the current execution point. The blockt-world problem in Figure 3 will be used to show an example of each 
type of problem. 

1 - Purpose not achieved. If the MN node negates any of the main effect* of the action just executed, there i* a problem. The main effect* must be 
reachieved. If during execution of the first PUTON node in the plan in Figure 3, either -«(ON B C) or (ON B D) is given as an unexpected effect, then the 
MN node inserted after the PUTON node will negate the purpose of the PUTON node - thereby resulting in an instance of this type of problem. 

7 - Previous phantom* not maintained. SIPE hasp* a list of phantom nodas that occur before the current execution point (including those on parallel 
branches), and whose MAINSTEP slot specifies a point in the plan that has not yet been executed. Theee are phantoms that must be maintained. If the 
MN node negates any of these, then there is a problem. The phantoms that are no longer true must be reachieved. Suppose that during execution of the 
first PICKUP node in the plan In Figure 3, ^(CLEAR C) is given as an unexpected effect. This type of problem will than occur, since the phantom node 
(CLEAR C) has a MAINSTEP slot (not shown in the figure) pointing to the first PUTON node, but ha* been negated by the MN node after the first 
PICKUP nod*. 

3 - Process nod* using unknown venial/* a* argument. If a variable has baan declared a* unknown, than th* first action using it is an argument must 
be prveedvd by a perception action for determining the value of the variable (see Section 3). If tbs B in the plan were the instantiation of the variable 
BLOCK 1 (instead of being given as part of the problem), and UNKNOWN BLOCK 1 were entered during execution of the first PICKUP action, then this 
type of problem would occur with the immediately following PUTON action, since it would be applied to an UNKNOWN argument. 

4 - Future phantoms no longer tru*. A phantom nod* after tha current execution point may no longer be true. It must be changed to a GOAL nod* so 
that the planner will try to achieve it. In th* sample plan, suppose that (ON D B) were given as an effect during execution of the first PUTON node. This 
type of problem would then occur with the last (CLEAR B) phantom node in the plan, since it would no longer be true when it is expected to be. 

5 - Future precondition no longer true. A PRECONDITION node after the current execution point may no longer be true. In this case, we do aot want to 
r* achieve it, but rather pop up the hiera*xhy and perform some alternative action to achieve the goal at that level of tha hierarchy. Because the sample plan 
contains no PRECONDITION nodes, we consider an example of this type in the travel planning domain. Suppose there ia an operator for John’s taking a 
taxi to tha airport, which has a precondition that John’s car is inoperative. If, daring execution of the first pert of th* plan, SIPE ia told that John's car 
is not broken, tbii type of problem will occur. In this case th* reason for taking a taxi to tha airport has bean invalidated, and the general replanner will 
pop up th* hierarchy and apply a different operator to get John to the airport (presumably driving his car). 

6 - ParniUl postcondition not true. All the parallel poetconditions may no longer be true at a JOIN node. (This could be bandied by maintaining phantoms, 
but is more convenient to handle separately.) In this case, we must insert a set of parallel goals after the JOIN, one for each untrue parallel postcondition. 
The parallel poetconditions of th* new JOIN will be the same as those on the old JOIN. In the sample plan, th* last JOIN node will have both (ON A B) 
and (ON BC] u parallel postconditions (sine# they were in parallel originally). Suppose that (ON B TABLE) were given as an effect during th* execution 
of the last PUTON node in the plan. This type of problem would then occur, since the parallel postcondition of (ON B C) would no longer be true. 

Because of '.he way plans are encoded ia SIPE, these are the only things that need to be checked when determining whether an MN node affects tb* 
remainder of a plan. This illustrates how tha rich structure of plans in SIPE helps produce efficient problem detection. It should be noted, however, that 
processes (actions) are assumed to work whenever their precondition is true and when all phantoms whose MAINSTEP slot points to the process art true. 
(All such necessary conditions should be encoded as either preconditions or goals, in any case.) There is currently no check for loops caused by the same 
error happening repeatedly, with th* same fix being proposed by the general replanncr each time. Various simple checks could easily be added if this were 
a problem. 

Finally, there are two important points to note with regard to the problem recogniser. First, in addition to the^bove problems, possible serendipitous 
effects are also noted and included in the list of problems. If the main effect of some action later in the plan is true before the action is executed, then that 
is noted as a possible place to shorten the plan (this is discussed in more detail in the next section). Second, only the last three problems above interact 
with the sol»*‘ : — » to the truth maintenance problem, since only they involve the truth-value of predicates in situations after the current execution point. 
The problem .*cogniter takes into account any changed deductions (see Section 4.2} while looking for the latter three problems. 
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4.3 Solution to th« truth miintmoci problem 

SIPra solution to the truth ntialniM problem k bnesd ou the efficiency of its deductive capability. Since It k aeaumed that pro e arn a* work m «xp«ckd 
wkatvir (hair pncoaditioi k tm uf ail pkaatom# whoa* MAIN STEP iloi points to the process art tnw, oily deduced tffoeU need to bo checked for 
thoir dependence oo unexpected effects. (Tbo execution moaitor will aolvo probkma having to do with precondition# and phantom* tbal art not tr»e). 

SIPE’a dad active capability waa daaig nod to find a good balance botwoon expressiveness and efficiency. Whik providing tbo power of many neofnl deductions, 
it novertbekaa keope dodnction nndor control by aeverely restricting tbo dedactioaa that can be made, aa well aa by having triggers to control tbe application 
of deductive operators. Alt deductiona that can be made are performed at tbe time a node k inserted into tbe plan. Since deduction k not expensive, tbe 
truth maintenance problem k solved aimpiy by redoing the deduction# at each node in the plan after an MN nod#. Even tbk can be avoided in simple cases, 
becacee SIPE earns# a liat of changed predicate# a# it goes through tbe plan and, if they all become true Uter in tbe plan (without any deduced effects 
changing in ♦ o nterim), then tbe execution monitor need eot look at tke remainder of the plan (either for redoing deductiona or for finding problems). 


5 Replanning Actions 

The eight replanning actions described below, itEINSTANTIATE, INSERT, INSERT-CONDITIONAL, RETRY, REDO, INSERT- PARALLEL, POP- 
REDO, and POP-REMOVE have all been implemented in SIPE. These action# provide sufficient power to alter plana in a way that often retains much 
of the original plan. These are domain-independent actions, and they form the bask of the general replanner. They should also prove uaeful as s bask 
for domain-specific error recovery operator#. Both of these uses are described in more detail in Section fi. The first seven actions can all be used to solve 
problems found by the problem recogniser, while the last k used to take full advantage of serendipitous effects. 

Four of the replanning actions change the plan so that it will contain unsolved problems. Tbs intention (see Figure 1) k that the plan will then Uter be 
given to the normal planning module of SIPE (poesibly after a number of these replanning actions have changed the pUn). The planner will then attempt 
to find a solution that solvee all the problem* that have been corrected in the plan. The planner automatically checks to determine whetker node# it splices 
into the middle of the plan cause problems later, so that any solution found will be correct. (It doe# this when copying nodes down to the next lower level 
during planning.) In all actions described below, the context argument merely apecifiee the context of the current plan. 


e REINSTANTIATE (predicate nod « context^ 

This action attempt! to instantiate a variable differently so as to make the given predicate true in the situation specified by the given node. Thk appears to 
be a commonly useful replanning action. For example, it might correspond to using a different reeource if something has gone wrong with ‘he one originally 
employed in the plsn, or deciding to return to tbe hopper for another screw rather than trying to find the one that has just been dropped. 

An attempted reinstantiation is dons by looping through the arguments of the given predicate. For each argument that U a planning variable (aa oppoeed 
to an actual ground instance), SIPE checks to see if there is another instantiation for it that will make the predicate true. Thk is cheap and efficient in 
SIPE, since it merely involves removing the INSTAN constraint on the variable from the current context (and also from all variables constrained to be the 
same as this one), and then calling the normal matcher (which will return possible instantiations) to determine if the predicate k now tm*. Note that all 
other constraints that have been accumulated on this variable are left intact, so only instantiations that meet all relevant requirements are found. 

If new instantiations are found, the REINSTANTIATE action checks the remainder of the plan to see if any parts of it might be affected by the new 
instantiation. This is done by a routine similar to the problem detector described in Section 4 (in fact, the two share much of their code). REINSTANTIATE 
currently accepts new instantiations only if »,h*y cause no new problems (see discussion below on trade-offs). If all new instantiations are rejected, the old 
INSTAN constraint is simply replaced. Note that replanning may be done Uter in the plan after the REINSTANTIATE action because of other problems 
that were found; the only requirement here k that the REINSTANTIATE action itself not introduce new problems Uter in the plan. 

One might use REINSTANTIATE to help with the above mentioned problem of dropping a screw in the following way. Suppose that SCREW 1 k a planning 
variable, while Si and S2 are particular screws. The plan being executed could have SCREW! instantiated to SI, a phantom to be maintained with the 
goal of (KNOWN- LOCATION SCREWl), and a PROCESS node for moving SCREWl to achieve (AT SCREWl WORKBENCH). During execution of the 
latter node, SIPE is told that the finger separation of the arm is **ro. from thk it could deduce (among other things) -'(KNOWN-LOCATION SCREWl) 
and -(AT SCREWl WORKBENCH). The problem of not achieving the purpose of the PROCESS node will result in an (AT SCREWl WORKBENCH) 
goal being inserted in the plan. Without REINSTANTIATE, thk would involve finding the location of Si and moving it to the workbench - which may be a 
very hard problem (as anyone who hae ever dropped a screw k aware). The problem of not maintaining the phantom node could trigger REINSTANTIATE 
on the predicate (KNOWN-LOCATION SCREW l), which wovld result in SCREWl being reinstantiated to S2 (whose location k known). This would 
introduce no new problems and SIPE could proceed to solve the (AT SCREWl WORKBENCH) goal by getting S2 from the hopper. 

To prevent the introduction of a large search space, REINSTANTIATE k limited by the requirement that it not introduce new problem#. Thera are 
also tradeoffs in deciding when to apply REINSTANTIATE as it exkts, but these are discussed later in the paper. The implementation described above 
opts for reinstantiation only when it is likely to be the correct solution. Thk is consktent with SIPE*s running efficiently on the problems it does solve. 
Alternatively, new instantiations could be accepted even though they caused problems - as long as the latter are less severe than the problem* incurred 
by keeping the old instantiation Since SIPE has no way of comparing the difficulty of two set* of problems, REINSTANTIATE does sot do thk. Doing 
so would introduce another very large search space into the replanning process. However, it would not be difficult to change SIPE to explore thk search 
space if a domain warranted it. There are also ways to partially lift this restriction at the coil of a moderately increased search space (though the tradeoff* 
involved probably depend *>n the domain). 

One could al*t> expend more effort in hnding new instantiations. As implemented, this replanning action will find reinstantiation* when only one variabl* 
is changed. Some problem* could be solved by remstantiating a whole set of variables, but this would be more expensive and involve a search problem to 
decide which variables to include in the set. The decision to try only one variable was made because it is efficient while evidently powerful enough to be 
useful. If the ability to re instantiate sets ->f variables appeared useful, implementing it would certainly be tractable. 


• INSERT (node l nodet ) 

This action inserts the subplan beginning with nooel (which has been constructed) into the current plan after node2. All links between the new subplan 
and the old plan are inserted correctly. This is used as a subroutine by many of the actions below. 


• INSERT-CONDITIONAL (variable node context } 

This action is not very interesting, but complements the unknown variable feature, which may be useful. It simply inserts a conditional around the given 
node that tests whether the given variable is known. If it is, the given node is executed next; otherwise a failure node is executed. 
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• RETRY (nodo) 

This replanning action is very simple. The five* node ia in«m«d to be a phantom nod* mad il is changed to a goal nod* so that the planner will perceive 
k a* unsolved. 


• REDO (pttditote mod* context) 

This action creates a GOAL nods whoss goal is tht given prsdicats. It thsa calls INSERT to place this nsw nods aftsr the givsa nods ia the plan. Ths 
planner will see the new node ae an unsolved goal. 


e INSERT* PARALLEL feeds pttdicotei context) 

This action essentially does a REDO on each predicate ia the list PREDICATES and pats the resaltiag GOAL nodes in parallel between a newly created 
SPLIT and JOIN. This tabplaa is inserted a/ter NODE ia the plan. The planner will see these new nodes as unsolved goals. This action is useful for 
re achieving parallel postconditions. 


e POP- REDO feeds predicate* context ) 

This and POP- REMOVE are the most complicated of the replanning actions; it ia used to remove a hierarchical wedge from the plan and replace it with 
a node at the lowest level POP-REDO is used when a PRECONDITION node is no longer true and another action must be applied at a higher level It 
could also be used to find higher-level goals from which to replan when there are widespread problems causing the replarning to fail (this is not cnrrently 
implemented). 

When redoing a precondition failure, it is easy tc determine the wedge to be removed, since PRECONDITION nodes are copied down from one level to 
another. The top of ths wedge to be removed is the node that was expanded to initially pi .e the given PRECONDITION node (or one of its ancestors 
that is a PRECONDITION node) in the plan. Actually, only the bottom of the wedge is spliced out of the plan, as planning will continue only from the 
lowest level. The subplan that ia removed at the luwest level is replaced by a copy of the GOAL or CHOICEPROCESS node that was at the top of the 
wedge. (The INSERT replanning action is used for this.) This ia seen as an unsolved goal by the planner, which automatically checks to ascertain whether 
expansions of this node cause problems later in the plan. 

Let us consider the example mentioned earlier of John planning to take a taxi to the airport when his car is broken. The operator for taking the taxi could 
have a precondition -*(HAS-CAR JOHN AUTOl) v (BROKEN AUTOl). (This will match John’s not having a car or his car being broken.) This operator 
is applied to solve the GOAL node (AT JOHN AIRPORT) at a high level in the plan, causing a PRECONDITION nod* for the above precondition to be 
inserted ini'- -ae plan and copied down to all lower level* of the plan. Suppose that, during execution, ->(BROKEN AUTOl) is entered as an unexpected 
effect dwmg execution of a process before the PRECONDITION node. This nod* is a future PRECONDITION which becomes false, aad the general 
replanner will apply POP-REDO to the problem. The wedge that is deleted has the COAL node (AT JOHN AIRPORT) at the top, This may be ft very 
large wedge if its lowest level is as detailed as "find the phone book, look up taxi in the yellow pages, dial a taxi company,* etc. At the lowest level, the 
whole plan of finding a taxi and taking it to the airport is spliced out and replaced by an (AT JOHN AIRPORT) GOAL node. When SIPE’s planner is 
later called on this plan, this GOAL node may be solved by John’s driving his car to the airport. 

There is one potentially serious complication in the above description of POP-REDO. Namely, various constraints may have been posted on the planning 
variables because of decisions made in the wedge of the plan that has been effectively removed. Fortunately, because of SIPE’s use of alternative contexts, 
this is easily solved. A context is a list of choice points, and constraint* are posted relative to the choice point that forced them to be poeted. Therefore, 
this problem is solved by removing from the current context all the choice points that occurred ia the wedge of the plan that was effectively removed. 
This new context is given as the context argument to future planning actions, and no further action need be taken. This resulU in ignoring precisely those 
constraints that should be ignored. 


e POP-REMOVE (node predicattt context) 

SIPE takes advantage of serendipitous effects to shorten a plan by using POP-REMOVE, which removes a wedge but does not insert a node. (It shares 
much of its code with POP-REDO.) However, in this case it is nontrivial to decide which wedge to remove. There may be various wedges that are candidates 
and, ae with REINSTANTIATE, these candidates may cause problems later in the plan if they are removed. SIPE currently handles this case in the same 
way it handles REINSTANTIATE. Namely, it removes a wedge, checks to see if this causes any problems, and, if there are any, replaces the wedge. Thus, 
serendipitous effects are exploited only if doing so does not change the rest of the plan. This is a trade-off like ths one discussed previously. SIPE again 
opts for efficiency, bu*. could easily be changed to explore the additional search space of replanning after the removal of wedges. 

SIPE also reduces ths search space by generating only one candidate wedge. It gives up taking advantage of the serendipitous effect if this wedge does 
not work. The candidate wedge is generated by following ancestor links from the node given to POP- REMOVE (which supposedly has a purpose that has 
become true serendipitoosly), as long as some main effect of the candidate node is made true by one of the predicates in the list of given predicates (that 
have unexpectedly become true). The candidate node found in this manner determines the candidate wedge. The wedge is rejected immediately unless all 
its main effects are true in the given list of predicates. 

Figure 4 uses the example of getting John to the airport to help illustrate this selection process. This example depicts a frequently occurring case in which 
the last action at one level of a wedge achieves the main effect of every level above that. For example, at Level I the goal is only to get John to the airport. 
At Level 2, after the choice has been made to take the taxi, the last node will achieve getting both John and the taxi to the airport. If Level 3 plans the 
mechanics of leaving the taxi, the last node there might contain all these higher-level effects ms well as the thinner state of John’s wallet. 

The above selection process requires that all goals generated at a higher level and achieved in the candidate wedge be achieved before the wedge becomes 
a candidate, while goals generated at a lower level than the top of the candidate wedge need not have been achieved serendipitously. Thus, for Wedge 2 
to be selected in Figure 4, the serendipitous effects must include (AT JOHN AIRPORT) from the higher level but need say nothing about how much caah 
John has sines that ia at a lower level. (It is assumed that, m long as ths highest-level goal is achieved, we do not care a*, ut the lower-level goals that 
were necessary to bring this about.) The main effects of higher-level nodes that are achieved within a candidate wedge are easily checked because they 
are copied down as effect* of the node that achieve* them. Thus, checking to verify that all main effects of ths candidate weegs axe true ensures that all 
important higher-level effects will be true. In the example as shown, Wedge 2 can never **■ . .ccted by this selection process because Wedge 1 will Work 
whenever Wedge 2 does. However, in another example the effects of Wedge 1 might be achi#.*«. . i Level Axfor* Wedge 2 so that Wedge 2 might then be 

\ 
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Figure 4: Hierarchical Wedge# with a Common Last Action 

6 Guiding the Replanning 

The replanning actions of the preceding section form the basis for SIPE’S general replanning capability and for a langnage capable of specifying domain- 
specific error recovery instructions that has Keen designed bnt not implemented. The latter could be thought of as instructions for guin.ng the search of 
the general replanner. This section describes the automatic replanner and briefly outlines the error recovery operators. 

0.1 The General Replanner 

The general replanner takee a list of probleme as welt ss poeeible serendipitous effects from ths problem recognissr, snd calls ons or mors of ths replanning 
action in an attempt to solve the problem. It first checks that a listed problem is still a problem, sines the REINSTANTIATE action may solve many 
problems at once. 

If the problem is a purpose that is not being achieved, ths system tries a REDO, which inserts the unachieved purpoee as a GOAL nods after the MOTHER- 
NATURE node. If the problem ie a previous phantom not being maintained, SIPE first tries REINSTANTIATE snd, if that fails, it calls RETRY. Ths 
idea is that, if there ia another object around with all the desired propertiee, it would be easier to use that object than to reachieve the desired state with 
the original object. If a PROCESS node has an unknown variable as an argument, INSERT-CONDITIONAL is called. If a future phantom ia no loafer 
true, RETRY is called. As with maintaining phantoms, REINSTANTIATE may be more appropriate, but, in both cases, this depends entirely on ths 
domain; thus the selection here is arbitrary. For preconditions that are not true, the general replanner first calls REINSTANTIATE and, if that fails, calls 
POP-REDO. If parallel postconditions are not true, the general replanncr calls INSERT-PARALLEL with the appropriate parallel goals. 

Whilt a general replanning capability ie a significant achievement, one cannot expect very impressive performance from a replanncr that does not have 
domain-specific information for dealing with errors. For example, whether or not REINSTANTIATE is Ukely to succeed will be dependent on ths domain. 
The automatic replanncr makes reasonable guesses at what might be a good choice in the domains on which SIPE has been tested. Since it merely chooses 
a replanning action for each type of problem that is found, it is very simple and could easily be rewritten for different domains. 

6.2 Error Recovery Language 

We also have designed an extension of the operator description language that enables instructions for handling foreseeable error* to bs included in operators. 
This will allow encoding of domain-specific knowledge for guiding the search of the general replanner (or even avoiding the search altogether). The error 
recovery operators will have the same syntax as all other SIPE operators, with soms new additions made to this languagt as described below. The plots of 
these operators will include references to the replanning actions in Section S. SIPE’s ability to specify conditional plans in operators can be used to try a 
second replanning action if the first one fails. 

Ths error recovery operators will mstdi their argument list to ths arguments of the node being executed so that original problem variables can be bound to 
the variables in the operators. There will be two ways to invoke these operators: one for general operators that solve problems that have been recognised, 
and one for more specific operators that act directly on unexpected predicates. Both are described below. 

The general operators will be applied after a MN node is added and problems havt been found by the problem recogniser, but before the general replanner 
is called (sec Figure 1). They will be applied to each of the problems in turn. Like deductive operators, error recovery operators will havt a TRIGGER 
slot {10} to determine when they should be applied. The trigger will be a a combination of keywords and predicates, with the keywords referring to 
the six types of problems. These triggers will match when their keyword matches the problem being tried and any predicate in the trigger matches the 
appropriate predicates given in the problem. These operators may also havt normal preconditions, which will be matched (in the normal manner) in the 
situation specified by the MN node. If any general error-recovery operator matches a given problem (i.e., both the trigger and precondition match), then 
the general replanner simply uses the instructions in the plot of the operator to choose the replanning actions to perform (rather than applying its own 
actions). Thus domain-specific guidance is supplied to the general replanner. It would be easy for such an operator, for example, to always force or prevent 
a REINSTANTIATE for a certain type of error under certain conditions. 

Specific error- recovery operators are apnlied directly to predicates as they are inputted to the execution monitor before the problem recogniser is called. 
This should not be expensive because only operators mentioned ia the ERROR slot of the action being executed are tried (see below). This ability seems 
attractive, since it can save a lot of effort when there is good domain-dependent error-recovery information available. When a specific operator matches 
an unexpected predicate, it may be possible in certain domains to simply apply the operator and assume that it will solve any problems caused by the 
unexpected predicate, thus circumventing the normal problem detection mechanism. If this option is chosen, SIPE simply assumes these error recovery 
operators are correct. Normal operation would involve checking for problems ss usual after the application of specific operators. 

Nodes in plots of regular operators will be able to specify an ERROR slot that gives names of specific error-recovery operators. When a node with an 
ERROR slot is being executed, the execution monitor will apply the operators listed in the error slot immediately to any unexpected predicate that is 
inputted during execution. Matching will be the same as for general operators, except that there arc no keywords in the trigger. If one of these operators 
matches, the replanning actions in its plot will be carried out immediately. There may or may not be an option to preclude further problem detection on a 
predicate that has been so matched. 
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Af mi example of the intended om of a specific error recovery operator, eouidir ill* problem of dropping a screw that REINSTANTIATE solved la lb* 
general repUnner. Suppose It is always desirable to return to tbs hopper after dropping a scmr during tbs procsss of transporting it. Tbs nods for moving 
OBJECT 1 to LOCATION 1 would have an sttot slot that Ustsd tbs DROP-SCREW opsrator. This specific error-recovery opsrator would match whenever 
tbs band soddsnly bscomss amply and OBJECTl is a scrsw. Tbs plot of tbb opsrator could than ipscify a REINSTANTIATE of tbs variable OBJECT 1 
(which will still b* constrainsd to b* a scrsw) followsd by an INSERT of a GOAL nods to b* acbisvsd (AT OBJECT! LOCATlONl). Uss of this opsrator 
could solve all problems of dropping a screw so that uss of both tbs problem rscognlssr and general rsplanner could be avoided. 


7 Examples 

This section presents two simple examples of SIPE monitoring tbs execution of a simple plan, then replanning when things do not go as expected. SIPE 
has been tested on larger and more complex problems than those presented her*. They involve a stands* d blocks world with ON and CLEAR predicates 
and a PUTON operator as described in more detail elsewhere (10|. The user inputs only what is sxplicitly mentioned in boldface below; everything else is 
generated automatically by the system. This Erst problem was constructsd to show tbs succsssful uss of the REINSTANTIATE replanning action, and the 
second shows how the system inserts a newly created eubplan during the replanning process. 

Figure 5 shows the initial world state and the original problem. The problem Is to get A on C in parallel with getting any blue block on any red block. In 
the initial world B1 and B2 are the only blue blocks (they are both on the table) and Rl and R2 are the only red blocks (R*. is on Bl and R2 is on ths 
table and clear). Since A and C are both dear initially, SIPE quickly finds a two-action plan of putting A on C In parallel with putting B2 on R2, as shown 
in Figure 6. 

This plan is then given to the execution monitor module of SIPE, which asks if P197 or P188 is to be executed first. The user types P19T and the system 
asks for unexpected effects. In this case the user types (ON D R2) followed by NIL to show ons unexpected effect, namely D has suddenly appeared on 
top of A2. This creates a MN nod* after P197 which also has the following effects deducsd by the system: -’(ON D TABLE) A •’(CLEAR R2). The problem 
recogniser is called and it finde only one problem, namely the PHANTOM node PI6S in the parallel branch wae being maintained but is no longer true. 
This is given to the general replanner which first tries REINSTANTIATE. This succteda as the OBJECTl vtriable in the PHANTOM node an be rebound 
to Rl without causing any new problems in the plan. The plan in Figure 7 is passed from the planning module back to the execution monitor (without 
showing phantom nodes and mainstep slots). P168 is then executed without any unexpected effects and the goal is achieved. Note that the original plan 
was retained in Us entirety and that B2 was placed on Rl instead of R2, thus achieving the original goal of getting A on C and any blue block on any red 
block. 

The second problem is the same as the firet, except that the variable REDBLOCKl is constrained not to be Rl (by specifying IS NOT Rl in the original 
problem). This will cause REINSTANTIATE to be attempted but fail, since R2 is the only other nu block. The original plan produced by SIPE is the 
same and the unexpected situation input by the user is the same. The problem recogniser again passe* the same problem to the general replanntr. This 
time SIPE tries REINSTANTIATE and fails, so it calls RETRY, which causes the (CLEAR R2) phantom in Figure 6 to be made into a goal. Tht planner 
solves this by producing a plan that puts D back on ths table before Bl it placed on R2. The subptan shown in Figure 8 replaces the (CLEAR R2) 
phantom node, P165, on the parallel branch before the PUTON B2 R2 node in Figure 6. Without unexpected events, the plan so constructed then executes 
correctly to achieve the original goal. Alternatively, more unexpected occurrences could be given during execution of the newly constructed plan and SIPE 
would again go through a similar loop of finding and fixing problems until the original goal is achieved. 


8 Comparison with Other Systems 

There is very little previous work in this area, since most domain-independent planning systems do not address the problem of replanning. Two that do 
(3,6) are discussed below. Tate’s NONUN (6j, and Vere’t DEVISER {9| do not concern themselves regarding execution. PLANXlO |lj lists “plan revision 
strategies” as an area for future work, but doee not appear to do replanning currently. McCalla and Schneider’s ELMER (5) has a module called the 
"executor* and claims to take an integrated view of planning and execution. The executor adds more detail to the plan by simulating execution. For 
example, secondary plans are added in parallel with the original plan (4) to provide a demon-like capability for handling certain situations that may arise. 
This is not replanning, but rather a more detailed level of planning, albeit with complex planning operations. The executor effectively produces complex, 
conditional plans (with possibly complex parallel interactions) for situations it forsees. It does not accept arbitrary input during execution and then replan 
by changing the original plan as SIPE does. In fact, the authors mention that ”to allow replanning after a plan goes awry* [5j is a future step in their 
research. 

Sussman’s HACKER [7| does modify plans (as do most planners that handle parallel actions), but does not deal with unexpected occurrences. HACKER 
produces plans that are not correct, then simulates them to detect errors. HACKER then soWes some of thee* errors by using a few simple actions, such as 
reordering parallel act ion ■ or reachieving subgoals that have not been maintained. Thus, the program is actually dealing with expected, not unexpected, 
occurrences. SIPE generates correct plans to begin with, then modifies them on the basis of arbitrary unexpected occurrences. What HACKER does with 
regard to plan modification is analogous to what the critica do in the standard planning module of SIPE. While some of the problems found by such critics 
are similar to those found by the problem detector in SIPE (e.g., previous phantoms not being protected), they are only a subset. SIPE also provides a 
richer set of replanning actions for modifying plans. 

The PLAN EX system at SRI International (2j was used to monitor the execution of STRIPS plans that were represented in triangle tables. PLANEX does 
not do replanning because it never changes the sequence of actions in the plan. However, it does allow for a weak version of the REINSTANTIATE action 
in SIPE where a variable can be reinstantiated and the same plan restarted. Without SIPE** ability to post constraints on variables, this is less useful. 
PLANEX uses the triangle table representation to determine the latest point in the plan where execution could begin in the current situation (unexpected 
or expected), including both the executed and unexecuted portions of the plan in this calculation. If unexpected occurrences create a situation in which 
restarting the plan from some point other than the current execution point would solve the problem, PLANEX would do this. (Note this may involve 
redoing previous actions or skipping actions that had been planned.) 

Although PLANEX can restart the original plan at a different point, this should not be construed as replanning. Moreover, it is not likely to be useful in a 
realistic domain. The world is not so benign as to frequently have unexpected occurrences produce situations in which ones’s original plan is still applicable 
exactly as is from some point. With very high-level examples (as in (2|), this may occasionally happen, but it will happen only rarely with detailed plans. 
For example, an action such as "pick up block B (wherever it may be)* can simply be repeated when B is accidentally dropped and its new location is 
unknown. However, if the robot must plan to go to the location of B before picking it up, the original plan will be applicable only in the unlikely event 
that B is accidentally dropped onto its original location. 

Hayes's system (3] and Sacerdoti's NOAH [6| have addressed the replanning problem. However, the approaches used in both these systems are considerably 
simpler and less powerful than that of SIPE. For example, NOAH does not allow the input of arbitrary predicates, so the general replanning problem never 
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Figure 5: Initial Block* World and Problem to be Solved 



Process: P197 
Action: PUTON.PRIM 
Effects: (ON A C) 
Dedace: (CLEAR B) 
-(ON A B) 
-(CLEAR C) 
Mainstep: PURPOSE 


Phantom: P162 
Goal: (CLEAR B2) 
Mainstep: P168 





Executed: P197 
Action: PUTON.PR1M 
EffecU: (ON A C) 
Dedace: (CLEAR B) 
-(ON A B) 
-(CLEAR C) 


Process: P188 
Action: PUTON.PRrM 
Effects: (ON B2 Rl) 
Deduce: -(CLEAR Rl) 
-(ON B2 TABLE) 



Figure 7: New Plan Produced for Continuing Execution 


Phantom: P440 
Goal: 

(CLEAR TABLE) 


Phantom: P437 
Goal: (CLEAR D) 


I Process: P443 I 

Action: PUTON.PRIM 
1 Effects: (ON D TABLE) 
Deduce: (CLEAR R2) 

I -(ON D R2) I 


Phantom: P450 
Goal: (CLEAR R2) 


Figure 8: Subplan for Replacing PHANTOM P185 
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irises. ti does permit ths UMr to specify that whoU wedges had been executed at once, and allows a nod# that has just been executed to be planned for 
again if it fails. This essentially provides one limited replanning action that is useful only in very specific situations. 

Hayes's system does allow the input of some information about unexpected situations. It is not clear what types of information can be provided, but they 
appear lees general than the arbitrary predicates accepted by S1PE. The system's only replanning action is to delete part of the plan. This permits the 
planner to reachieve higher-level goals, bnt they must be the tame higher-level goals that were already present in the plan. The system deletes everything 
that depended on any effect of a decision that is no longer valid. This will in general be wasteful, since much of the plan may be removed unnecessarily. If 
only one of the many effects of an action has failed, subplans depending on the unfailing effects do not nsed to bs deleted. SIPE would keep such subplane 
in the plan (and find any problems that may have been generated within them). 

SIPE provides a much more powerful replanning capability than either of these systems. It allows ths input of arbitrary predicates, computes ths extent 
to which these effect ths rest of tbs plan, only finds complications that are really problematical, and uses a large number of replanning actions (including 
REINSTANTIATE) to remedy problems In ways that enable much of the original plan to be maintained. 


9 Conclusion 
0.1 Summary 

Given correct information about unexpected events, SIPE is able to determine how this affects ths plan being executed. In msnv cases, it Is able to retain 
most of the original plan by making changes in it to avoid problems caused by these unexpected events. It is also capable of shortening the original plan 
when serendipitous events occur. It cannot solve di. r icult problems involving drastic changes to ths expected stats of the world, but it doss handle many 
types of small errors that may crop up frequently in a mobile robot domain. The execution-monitoring package does this without the necessity of planning 
in advance to check for such errors. 

The major contribution of thin work is the development of a general set of replanning actions that are used as the basis of an automatic rcplanncr, as well 
as the basis of a language for specifying domain-dependent error recovery information. These actions provids sufficient power to alter plans in a way that 
often retains much of the original plan, (e.g., the REINSTANTIATE action). The general rcplanner attempts to solve all problems that are found. It is 
unlikely to be very successful unlees it is adapted to particular domains. The design of the language for error recovery operators allows both for operators 
that will handle very specific situations and for those that will give more general sdvics to ths replanner. 

The success of these mechanisms can largely be attributed to taking advantage of the rich structure of StPE's planner and its plans. Often, the replanner 
calls the standard planning system as a subroutine. In this way it can taka advantage of the efficient frame-reasoning mechanisms in SIPE to discover 
problems and potential fixes quickly, applying its deductive capabilities to provids a reasonable solution to the truth maintenance problem. The fixes 
suggested by the replanner need involve only the insertion of new goals into ths plan, sines calling the planner as a sabroutins will solve these goals in 
a manner that asaures there will be no conflicts with the reet of the plan. SIPE’e execution- monitoring cnpnbilitiee make extensive use of the explicit 
representation of plan rationale. The problem detector makes usee of the information encoded in MA1NSTEP slots, phantoms, and precondition! to quickly 
find all the problems with a plan. Furthermore, it does not remove par's of the original plan unless the parts are actually problematical. The replanning 
actions make ass of constraints, alternntive contexts, and wedges in SIPE whsnever they consider removing part of the plan. 


0.2 Iatucj and Limitations 

From the beginning, the rationale behind SIPE has been to place enough limitations on ths represent stion so that planning can be done efficiently, while 
retaining enough power to itill be useful. Thie motivation underlies most of the design decisions that have been made in implementing the system, including 
the design of the replanning module. For example, REINSTANTIATE and POP-REMOVE are limited to prevent the exploration of large search spaces. 
The use of SIPE’s deductive capability to solve the truth maintenance problem also reflects our commitment to this design philosophy. Ths replanning 
capabilities have proved useful in two test domains. 

The major limitations of this research stem from the assumption of correct information about unexpected events. This avoids many difficult problems, 
the most important of which is generating the high-level predicates used by SIPE from information provided by the sensors. This appears to be the most 
critical issue in getting a high-level planner such as SIPE to control a mobile robot. Part of the problem is heuristic adequacy - the robot cannot wait ten 
minutes for a vision module to turn pixels into predicates while the world is changing. Other questions that have not been discussed here are deciding how 
much effort to expend checking facts that may be suspect, and modeling uncertain or unreliable sensors. Finding solutions to these problems is * f crucial 
importance to the task of endowing a mobile robot with execution-monitoring capabilities. 
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To control problem solving activity, a planner must rosolv^/unccrtainty about which specific 
long-term goals (solutions) to pursue anti about which sequences of actions will best achieve 
those goals. In rtm paper, we -drserthe a planner/that abstracts the problem solving 
state to recognize possible competing anti Compatible solutions anti to roughly predict the 
Importance and expense of developing these solutions. With this information, the planner 
plans sequences of problem solving activities that most efficiently resolve its uncertainty 
about which of the possible solutions to work toward. The planner only details actions for 
the near future bee a usd the results of these actions will influence how (and whether) a plan 
should be pursued. As problem solving proceeds, the planner adds new details to the plan 
incrementally, and monitors and repairs the plan to insure it achieves its goals whenever 
possible. Through cxpcriments r ^**-i!lustrate how these new mechanisms significantly 
improve problem solving decisions\ and reduce overall computation. Mfe briefly discuss 
mk currenCrescarch directions, including how these mechanisms can improve a problem 
solver’s real-time response and can enhance cooperation in a distributed problem solving 
network. / 




1. Introduction 


O 




A problem solver’s planning component must resolve control uncertainty stemming from two principal sources. As in typical 
planners, it must resolve uncertainty about which sequence of actions will satisfy its long-term goals. Moreover, whereas most 
planners are given a (possibly prioritized) set of well-defined long-term goals, a problem solver’s planner must often resolve 
uncertainty about the goals to achieve. For example, an interpretation problem solver that integrates large amounts of data into 
“good" overall interpretations must use its data to determine what specific long-term goals (overall interpretations) it should 
pursue. Because the set of possible interpretations may be intractably large, the problem solver uses the data to form promising 
partial interpretations and then extends these to converge on likely complete interpretations. The black board- based problem 
solving architecture developed in Hearsay-II permits such data-directcd problem solving |ll. 


In a purely data-directed problem solver, control decisions can be based only on the desirability of the expected immediate 
results of each action. The Hearsay-ll system developed an algorithm for measuring desirability of actions to better focus problem 
solving [2\. Extensions to the blackboard architecture unify data-directed and goal-directed control by representing possible 
extensions and refinements to partial solutions as explicit goals ,3|. Through goal processing and subgoals, sequences of related 
actions can be triggered to achieve important goals. Further modifications separate control knowledge and decisions from problem 
solving activities, permitting the choice of problem solving actions to be influenced by strategic considerations ’A]. However, none 
of these approaches develop and use a high-level view of the current problem solving situation so that the problem solver can 
recognize and work toward more specific long-term goals. 


In this paper, we introduce new mechanisms that allow a blackboard-based problem solver to form such a high-level view. By 
abstracting its state, the problem solver can recognize possible competing and compatible interpretations, and can use the abstract 
view of the data to roughly predict the importance and expense of developing potential partial solutions. These mechanisms are 
much more flexible and complex than those we previously developed 5j and allow the recognition of relationships between distant 
as well as nearby areas in the solution space. We also present new mechanisms that use the high-level view to form plans to 
achieve long-term goals. A plan represents specific actions for the near future and more general actions for the distant future. By 
forming detailed plans only for the near future, the problem solver does not waste time planning for situations that may never 

i This research was sponsored, in part, by the National Science Foundation under Grant MCS-8300327, by the National Science Foundation under Support 
and Maintenance Grant DCK-831877fi, by the National Science Foundation under CER Grant DCR-8500332, and by the Defense Advanced Research Projects 
Agency (DOD), monitored by the Office of Naval Research under Contract NR0I9-041. 
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arise; by sketching out the entire plan, details for the near-term can be based on a long-term view. As problem solving proceeds, 
the plan must be monitored (and repaired when necessary), and new actions for the near future are added incrementally. Thus, 
plan formation, monitoring, modification, and execution are interleaved |6,7,8,9,10|. 

Wc have implemented and evaluated our new mechanisms in a vehicle monitoring problem solver, where they augment previously 
developed control mechanisms. In the next section, we briefly describe the vehicle monitoring problem solver. Section 3 provides 
details about how a high-level view is formed as an abstraction hierarchy. The representation of a plan and the techniques to form 
and dynamically modify plans are presented in Section A. In Section 5, experimental results are discussed to illustrate the benefits 
and the costs of the new mechanisms. Finally, Section 6 recapitulates our approach and describes how the new mechanisms can 
improve real-time responsiveness and can lead to improved cooperation in a distributed problem solving network. 


2. A Vehicle Monitoring Problem Solver 

A vehicle monitoring problem solving node, as implemented in the Distributed Vehicle Monitoring Testbed (DVMT), applies 
simplified signal processing knowledge to acoustically sensed data in an attempt to identify, locate, and track patterns of vehicles 
moving through a two-dimensional space 11;. Each node has a black board- based problem solving architecture, with knowledge 
sources and levels of abstraction appropriate for vehicle monitoring. A knowledge source (KS) performs the basic problem solving 
tasks of extending and refining hypotheses (partial solutions). The architecture includes a goal blackboard and goal processing 
module, and through goal processing a node forms knowledge source instantiations (KSIs) that represent potential KS applications 
on specific hypotheses to satisfy certain goals. KSIs are prioritized based both on the estimated beliefs of the hypotheses each may 
produce and on the ratings of the goals each is expected to satisfy. The goal processing component also recognizes interactions 
between goals and adjusts their ratings appropriately; for example, subgoals of an important goal might have their ratings boosted. 
Goal processing can therefore alter KS! rankings to help focus the node’s problem solving actions on achieving the subgoals of 
important goals 3|. 

A hypothesis is characterized by one or more time- /orations (where the vehicle was at discrete sensed times), by an event-efass 
(classifying the frequency or vehicle type), by a belief (the confidence in the accuracy of the hypothesis), and by a blackboarddevet 
{depending on the amount of processing that has been done on the data). Synthesis KSs take one or more hypotheses at one 
blackboard-level and use event-class constraints to generate hypotheses at the next higher blackboard-level. Extension KSs take 
several hypotheses at a given black board- level and use constraints about allowable vehicle movements (maximum velocities and 
accelerations) to form hypotheses at the same blackboard-level that incorporate more time-locations. 

For example, in Figure I each blackboard-level is represented as a surface with spatial dimensions x and y. At blackboard- 
level s (signal level) there are 10 hypotheses, each incorporating a single time-location (the time is indicated for each). Two of 
these hypotheses have been synthesized to black board- level g (group level). In turn, those hypotheses have been synthesized to 
blackboard-level v (vehicle level) where an extension KS has connected them into a single track hypothesis, indicated graphically 
by connecting the two locations. Problem solving proceeds from this point by having the goal processing component form goals 
(and subgoals) to extend this track to time 3 and instantiating KSIs to achieve these goals. The highest rated pending KSl s 
then invoked and triggers the appropriate KS to execute. New hypotheses are posted on the blackboard, causing further goal 
processing ami the cycle repeats until an acceptable track incorporating data at each time is created. One of the potential solutions 
is indicated at black hoard -level u in Figure 1. 



ar** u airfares containing hypotheses (with associated sensed 

timesj Hypotheses at lusher blackboard-levels are synthesized from lower level data, and 
i potential solution i> illustrated wtth a dotted track at blackboard-level v. 


Figure 1: An Example Problem Solving State. 
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3. A High-level View for Planning and Control 


Planning about how to solve a problem often requires viewing the problem from a different perspective. For example, a chemist 
generally develops a plan for deriving a new compound not by entering a laboratory and envisioning possible sequences of actions 
but by representing the problem with symbols and using these symbols to hypothesise possible derivation paths. By transforming 
the problem into this representation, the chemist can more easily sketch out possible solutions and spot reactions that lead nowhere, 
thereby improving the decisions about the actions to take in the laboratory. 

A black board- based, vehicle monitoring problem solver requires the same capabilities. Transforming the node’s problem solving 
state into a suitable representation for r tanning requires domain knowledge to recognize relationships — in particular, long-term 
relationships — in the data. This transformation is accomplished by incrementally clustering data into increasingly abstract groups 
based on the attributes of the data: the hypotheses can be clustered based on one attribute, the resulting clusters can be further 
clustered based on another attribute, and so on. The transformed representation is thus a hierarchy of clusters where higher-level 
clusters abstract the information of lower-level clusters. More or less detailed views of the problem solving situation are found by 
accessing the appropriate level of this abstraction hierarchy, and clusters at the same level are linked by their relationships (such 
as having adjacent time frames or blackboard-levels, or corresponding to nearby spatial regions). 

We have implemented a set of know I edge- based clustering mechanisms for vehicle monitoring, each of which takes clusters 
at one level as input and forms output clusters at a new level. Each mechanism uses different domain-dependent relationships, 
including: 

• temporal relationships: the output cluster combines any input clusters that represent data in adjacent time frames and 
that are spatially near enough to satisfy simple constraints about how far a vehicle can travel in one time unit. 

• spatial relationships: the output cluster combines any input clusters that represent data for the same time frames and 
that are spatially near enough to represent sensor noise around a single vehicle. 

• blackboard-level relationships: the output cluster combines any input clusters that represent the same data it different 
blackboard-levels. 

• event-class relationships: the output cluster combines any input clusters that represent data corresponding to the same 
event-class (type of vehicle). 

• belief relationships: the output cluster combines any input clusters that represent data with similar beliefs. 

The abstraction hierarchy is formed by sequentially applying the clustering mechanisms. The order of application depends on the 
bias of the problem solver: since the order of clustering affects which relationships are most emphasized at the highest levels of the 
abstraction hierarchy, the problem solver should cluster to emphasize the relationships it expects to most significantly influence 
its control decisions. Issues in representing bias and modifying inappropriate bias are discussed elsewhere 12!. 

To illustrate clustering, consider the clustering sequence in Figure 2, which has been simplified by ignoring many cluster 
attributes such as event-classes, beliefs, volume of data, and amount of pending work; only a cluster s blackboard-levels (a cluster 
can incorporate more than one) and its time-regions (indicating a region rather than a specific location for a certain time) are 
discussed. Initially, the problem solving state is nearly identical to that in Figure 1 , except that for each hypothesis in Figure 1 
there are now two hypotheses at the same sensed time and slightly different locations. In Figure 2a, each cluster c* n (where / is 
the level in the abstraction hierarchy) corresponds to a single hypothesis, and the graphical representation of the clusters mirrors 
a representation of the hypotheses. By clustering based on blackboard-level, a second level of the abstraction hierarchy is formed 
with 19 clusters (Figure 2b). As is shown graphically, this clustering “collapses" the blackboard by combining clusters at the 
previous abstraction level that correspond to the same data at different blackboard-levels. In Figure 2c, clustering by spatial 
relationships forms 9 clusters. Clusters at the second abstraction level whose regions were close spatially for a given sensed time 
are combined into a single cluster. Finally, clustering by temporal relationships in Figure 2d combines any clusters at the third 
abstraction level that correspond to adjacent sensed times and whose regions satisfy weak vehicle velocity constraints. 

The highest level clusters, as illustrated in Figure 2d, indicate four rough estimates about potential solutions: a vehicle moving 
through regions /?, /? 2 . through RiR 2 R^R 4 /?(/?';, through R\R' 2 RzR 4 R$R i u or through R\ R 2 R i R 4 RlR^. The problem 

solver could use this view to improve its control decisions about what short-term actions to pursue. For example, this view allows 
the problem solver to recognize that all potential solutions pass through /? 3 at sensed time 3 and R 4 at sensed time 4. By boosting 
the ratings of KSIs in these regions, the problem solver can focus on building high-level results that are most likely to be part of 
any eventual solution. 

In some respects, the formation of the abstraction hierarchy is akin to a rough pass at solving the problem, as indeed it must be 
if it is to indicate where the possible solutions may lie. However, abstraction differs from problem solving because it ignores many 
important constraints needed to solve the problem. Forming the abstraction hierarchy is thus much less computationally expensive 
than problem solving, and results in a representation that is too inexact as a problem solution but is suitable for control. For 
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A sequence of clustering step* ore illustrated both with tables (left) and graphically (right), 
d represents cluster » at level i of the abstraction hierarchy. In (a), each cluster is a 
hypothesis. These are clustered by black board- level to get (b); note that graphically the 
levels have been collapsed into one. These clusters are then grouped by spatial relationships 
to form (c). which m turn is clustered by temporal relationships to form (d). 


Figure 2: An Example of Incremental Clustering. 


example, although thn high-level clusters in Figure 2d indicate that there are four potential solutions, three of these are actually 
impossible based on the more stringent constraints applied by the KSs. The high-level view afforded by the abstraction hierarchy 
therefore does not provide answers but only rough indications about the long-term promise of various areas of the solution apace, 
and this additional knowledge can be employed by the problem solver to make better control decisions as it chooses its next task. 

4, . Incremental Planning 

The planner further improves control decisions by intelligently ordering the problem solving actions. Even with the high- 
level view, uncertainty remains about whether each long-term goal can actually be achieved, about whether an action that might 
contribute to achieving a long-term goal will actually do so (since long-term goals are inexact), and about how to most economically 
form a desired result (since the same result can often be derived in different ways). The planner reduces control uncertainty in two 
ways. First, it orders the intermediate goals for achieving long-term goals so that the results of working on earlier intermediate 
goals can diminish the uncertainty about how (and whether) to work on later intermediate goals. Second, the planner forms a 
detailed sequence of steps to achieve the next intermediate goal: it determines the least costly way to form a result to satisfy the 
goal. The planner thus sketches out long-term intentions as sequences of intermediate goals, and forms detailed plans about the 
best way to achieve the next intermediate goal. 
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A long-term vehicle monitoring goal to generate a track constating of several time-locations can be reduced into a series of 
intermediate goals, where each intermediate goal represents a desire to extend the track satisfying the previous intermediate goal 
into a new time-location. 1 To determine an order for pursuing the possible intermediate goals, the planner currently uses three 
domain- independent heuristics: 

Heuristic- 1 Prtftr common intermediate foots. Some intermediate goals may be common to several long-term goals. If uncertain 
about which of these long-term goals to pursue, the planner can postpone its decision by working on common intermediate 
goals and then can use these results to better distinguish between the long-term goats. This heuristic is a variation of 
least-commitment jl3|. 

Heuristic-3 Prefer less costly intermediate goals. Some intermediate goals may be more costly to achieve than others. The 
planner can quickly estimate the relative coats of developing results in different areas by comparing their corresponding 
clusters at a high level of the abstraction hierarchy: the number of event-classes and the spatial range of the data in a cluster 
roughly indicates how many potentially competing hypotheses might have to be produced. This heuristic causes the planner 
to develop results more quickly. If these results are creditable they provide predictive information, otherwise the planner can 
abandon the plan after a minimum of effort. 

Heuristic-3 Prefer discriminative intermediate goals. When the planner must discriminate between possible long-term goals, it 
should prefer to work on intermediate goals that most effectively indicate the relative promise of each long-term goal. When 
no common intermediate goals remain, therefore, this heuristic triggers work in the areas where the long-term goals differ 
most. 

These heuristics are interdependent. For example, common intermediate goals may also he more costly, as in one of the experiments 
described in the next section. The relative influence of each heuristic can be modified parametrically. 

Having identified a sequence of intermediate goals to achieve one or more long-term goals, the planner can reduce its uncertainty 
about how to satisfy these intermediate goals by planning in more detail. If the planner possesses models of the KSs that roughly 
indicate both the costs of a particular action and the general characteristics of the output of that action (based on the characteristics 
of the input), then the planner can search for the best of the alternative ways to satisfy an intermediate goal. We have provided 
the planner for our vehicle monitoring problem solver with coarse KS models that allow it to make reasonable predictions about 
short sequences of actions to find the sequences that best achieve intermediate goals. 2 To reduce the effort spent on planning, the 
planner only forms detailed plans for the next intermediate goal: since the results of earlier intermediate goals influence decisions 
about how and whether to pursue subsequent intermediate goals, the planner avoids expending effort forming detailed plans that 
may never be used. 

Given the abstraction hierarchy in Figure 2, the planner recognizes that achieving each of the four long-term goals (Figure 2d) 
entails intermediate goals of tracking the vehicle through these regions. Influenced predominantly by Heuristic-l, the planner 
decides to initially work toward all four long-term goals at the same time by achieving their common intermediate goals. A 
detailed sequence of actions to drive the data in Ry at level .1 to level v is then formulated. The planner creates a plan whose 
attributes (and their values in this example) are: 

• the long-term goals the plan contributes to achieving (in the example, there are four): 

• the predicted, underspecified time-regions of the eventual solution 

(in the example, the time regions are (l R l orR* t )( 2 R 1 orR! l )(3 R z ) ... ); 

• the predicted vehicle type(s) of the eventual solution (in the example, there is only one type of vehicle considered): 

• the order ot : rmediate goals (in the example, begin with sensed time 3, then time I, and then work both backward to 

earlier times and forward to later times); 

• the blackboard-level for tracking, depending on the available knowledge sources (in the example, this is level t/); 

• a record of past actions, updated as actions are taken (initially empty); 

• a sequence of the specific actions to take in the short-term (in the example, the detailed plan is to drive data in region Ry 

at level .$ to level v): 

• a rating based on the number of long-term goals being worked on, the effort already invested in the plan, the average ratings 
of the KSIs corresponding to the detailed short-term actions, the average belief of the partial solutions previously formed by 
the plan, and the predicted beliefs of the partial solutions to be formed by the detailed activities. 


l ln general terms, an intermediate goal in any interpretation task is to process a new piece of .nformation and to integrate it into the current partial 
interpretation. 

If the predicted cost of satisfying an intermediate goal deviates substantially from the crude -stimate based on the abstract view, the ordering of the 
intermediate goals may need to be revised. 
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Aj each predicted action is consecutively pursued, the record of past actions is updated and the actual results of the action an 
compared with the general characteristics predicted by the planner. When these agree, the next action in the detailed short-term 
sequence is performed if there is one, otherwise the planner develops another detailed sequence for the next intermediate goal, b 
our example, after forming results in rt* at a high black board- level, the planner forms a sequence of actions tojdo the same m 
R4 When the actual and predicted results disagree (since the planner*! models of the KSs may be inaccurate), the planner mmt 
modify the plan by introducing additional actions thst can get the plan back on track. If no such actions exist, the plan is aborted 
and the next highest rated plan is pursued. If the planner exhausts its plans before forming s complete solution, it reforms the 
abstraction hierarchy (incorporating new information and/or clustering to stress different problem attributes) and attempts te 
find new plans. Throughout this paper, we assume for simplicity thst no important new information arrives slier the abstraction 
hierarchy is formed; when part of a more dynamic environment, the node will update its abstraction hierarchy and plans whenever 
such information becomes available. 

The planner thus generates, monitors, and revises plans, and interleaves these activities with plan execution, in our example, 
the common intermediate goals are eventually satisfied and a separate plan must be formed for each of the alternative ways to 
proceed After finding a partial track combining data from sensed times 3 and 4. the planner decides to extend this track backward 
to sensed time 2. The long-term goals indicate that work sho 1 Id be done in either R t or R\. A plan is generated for each of tkc 
two possibilities, and the more highly rated of these plans is followed. Note, however, that the partial track already developed 
can provide predictive information that, through goal processing, can increase the rating of work in one of these regions and not 
the other. In this case, constraints thst limit a vehicle s turning rate are used when goal processing (subgoaling) to increase the 
ratings of KSTs in R\, thus making the plan to work there next more highly rated.* 

The planner and goal processing thus work in tandem to improve problem solving performance. The goal processing uses s 
detailed vi*w of local interactions between hypotheses, goals, and KSIs to differentiate between alternative actions. Goal processing 
can be computationally wasteful, however, when it is invoked based on strictly local criteria. Without the knowledge of long-term 
reasons for building a hypothesis, the problem solver simply forms goals to extend and refine the hypothesis in all possible ways. 
These goals are further processed (subgoaled) if they are at certain blackboard-levels, again regardless of any long-term justification 
for doing so. With its long-term view, the planner can drastically reduce the amount of goal processing. As it pursues, monitors, 
and repairs plans, the planner identifies areas where goals and subgoals could improve its decisions and selectively invokes goal 
processing to form only those goals that it needs. As the experimental results in the next section indicate, providing the planner 
with the ability to control goal processing can dramatically reduce control overhead. 

In summary, we have developed mechanisms that permit incremental planning of problem solving activities in a blackboard- 
based problem solver. These mechanisms interleave planning and execution, monitoring plans and replanning when necessary We 
base these mechanisms on having a high-level, long-term view of problem solving and on having acceptable models of problem 
solving actions. Furthermore, note that incremental planning may be inappropriate in domains where details about actions in the 
distant future can highly constrain the options in the near future. In these domains, constraints must be used »o detail an entire 
plan before acting 13 . However, in unpredictable domains, incremental planning, plan monitoring, and plan repair are crucial to 
effective control since plans about the near future cannot depend on future states that may never arrive. 


5. Experiments in Incremental Planning 


We illustrate the advantages and the costs of our planner in several problem solving situations, shown in Figure 3. Situation 
A is the same as in Figure 2 except that each region only has one hypothesis. Also note that the data in the common regions is 
most weakly sensed. In situation B. no areas are common to all possible solutions, and issues in plan monitoring and repair are 
therefore stressed. Finally, situation C has many potential solutions, where each appears equally likely from a high-level view. 

When evaluating the new mechanisms, we consider two important factors: how well do they improve control decisions (reduce 
the number of incorrect decisions), and how much additional overhead do they introduce to achieve this improvement. Since each 
control decision causes the invocation of a KSI. the first factor is measured by counting KSIs invoked —the fewer the KSIs, the 
better the control decisions. The second factor is measured as the actual computation time (runtime) required by a node to sohe 
a problem, representing the combined costs of problem solving and control computation. 

The experimental results are summarized in Table I. To determine the effects of the new mechanisms, each problem situation 
was solved both with and without them, and for each case the number of KSIs and the computation time were measured. We also 
measured the number of goals generated during problem solving to illustrate how control overhead can be reduced by having the 
planner control the goal processing. 


3 In fact the turn to /?; exceeds ihes* *onstrainta, os does the turn to Ri tt so that the only track that latuies the constraints is R* { R * 2 RiR+Rs A». 
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Figure 1; The Experimental Problem Situation*. 


Experiments El and E2 illustrate how the new mechanisms can dramatically reduce both the number of K5U invoked and the 
computation time needed to wive the problem in situation A. Without these mechanisms (El), the problem solver begins with 
the most highly sensed data [d it dj, and dJJ. This incorrect data actually corresponds to noise and may have been formed 
due to sensor errors or echoes in the sensed area. The problem solver attempts to combine this data through d% and d* but fails 
because of turning constraints, and then it uses the results from d s and d 4 to eventually work its way back out to the moderately 
sensed correct data. With the new mechanisms (E2), problem solving begins at d$ and d* and, because the track formed (did*) 
triggers goal processing to stimulate work on the moderate data, the solution is found much more quickly (in fact, in optima/ time 
14 ). The planner controls goal processing to generate and process only those goals that further the plan; if goal processing is 
done independently of the planner (E3), the overhead of the planner coupled with the only slightly diminished goal processing 
overhead (the number of goals is only modestly reduced, comparing E3 with El) nullifies the computation time saved on actual 
problem solving. Moreover, because earlier, less constrained goals are subgoaled, control decisions deteriorate and more KSls must 
be invoked. 


The improvements in experiment E2 were due to the initial work done in the common areas 
Situation A' is identical to situation A except that areas d s and d 4 contain numerous competing 
works in those areas ( £5), then many KSls are required to develop all of these hypotheses— fewer 
at all (E4). However, by estimating the relative coats of the alternative intermediate goals, the 
d 4 , although twice as common as the other areas, are likely to be more than twice as costly 
Heuristic- 1, and a plan is formed to develop the other areas first and then use these results to 
and d 4 . The number of KSls and the computation time are thus reduced (E6). 


d) and d 4 triggered by Heuristic- 1. 
hypotheses. If the planner initially 
KSb are invoked without planning 
planner can determine that d a and 
to work on. Heuristic- 2 overrides 
more tightly control processing in 
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Situ: The problem situation. 

Plan?: Are the new planning mechanisms used? 

KSls: Number of KSls invoked to 6nd solution. 

Rtixoe: The total runtime (computation time} to find solution (in minutes) 

Coals: The number of goals formed and processed. 

Comments: Additional aspects of the experiment. 


Table I: A Summary of the Experimental Remit,. 
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In situation B, two solutions must be found, corresponding to two vehicles moving in parallel. Without the planner (E7), 
problem solving begins with the most strongly sensed data (the noise in the center of the area) and works outward from there. 
Only after many incorrect decisions to form short tracks that cannot be incorporated into longer solutions does the problem solver 
generate the two solutions. The high-level view of this situation, as provided by the abstraction hierarchy, allows the planner in 
experiment Efl to recognise six possible alternative solutions, four of which pass through dj (the most common area). The planner 
initially forms pian x . /Wor» t , and pfon* , beginning in dj, d s , and ^ respectively (Heuristic-1 triggers the preference for dj, and 
subsequently Heuristic-3 indicates a preference for d s and <f t ). Since it covers the most long-term goala, plan x it pursued first— a 
reasonable strategy because effort is expended on the solution path if the plan succeeds, and if the plan fails then the largest 
possible number of candidate solutions are eliminated. After developing «ij, ple«i is divided into two plans to combine this data 
with either d 3 or d* r One of these equally rated plans is chosen arbitrarily and forms the track djdJ, which then must be combined 
with d|. However, because of vehicle turning constraints, only d t d| rather than d|d 3 d? is formed. The plan monitor flags an error, 
an attempt to repair the plan fails, and the plan abort* Similarly, the plan to form d^d^dj eventually aborts. P(*n t » then 
invoked, and after developing d s it finds that d 3 has already been developed (by the first aborted plan). However, the plan monitor 
detects that the predicted result. d 3 di waa not formed, and the plan is repaired by inserting a new action that takes advantage of 
the previous formation of d t d| to generate d|dtd 3 . The prediction* are then more than satisfied, and the plan continues until a 
solution is formed The plan to form the other solution is s .* rly successfully completed. Finally, note once again that, if the 
planner does not control goal processing (E9), unnecessary overhead costs are incurred, although this time the control decision* 
(KSIs) are not degraded. 

Situation C also represents two vehicles moving in parallel, but this time they are closer and the data points are all equally 
well sensed. Without the new mechanisms (ElO), control decisions in this situation have little to go on: from a local perspective, 
one area looks as good as another. The problem solver thus develops the data points in parallel, then forms all tracks between 
pairs of points, then combines these into larger tracks, until finally it forma the two solution tracks. The planner usei the possible 
solutions from the abstraction hierarchy to focus on generating longer tracks sooner, and by monitoring its actions to extend its 
tracks, the planner more quickly recognises failed extensions and redirects processing toward more promising extensions. The new 
mechanisms thus improve control decisions (reduce the KSIs) without adding excessive computations! overhead (EU). However, 
the planner must consider 32 possible solutions in this case and does incur significant overhead. For complex situations, additional 
control mechanisms may be needed by the planner to more flexibly manage the large numbers of powibilities. 

6. The Implications of Abstraction and Planning 

We have described and evaluated mechanisms for improving control decisions in a black board- baaed vehicle monitoring problem 
solver. Our approach is to develop an abstract view of the current problem solving situation and to use chief view to better predict 
both the long-term significance and cost of alternative actions. By recognizing and planning to achieve long-term goals, problem 
solving is more focused. By using the abstraction hierarchy when making planning decisions, problem solving can be more coat 
effective. Finally, by interleaving plan generation, monitoring, and repair with plan execution, the mechanisms lead to more 
versatile planning, where actions to achieve the system's (problem solving) goals and actions to satisfy the planner’s needs (resolve 
its own uncertainty) are integrated into a single plan. 

This approach can be generally applied to black board -based problem solvers. Abstraction requires exploiting relationships in 
the data relationships that are used by the knowledge sources as well— such as allowable combinations of speech sounds J I. or 
how various errands are related spatially or temporally 4 .V Planning requires simple models of KSs, recognition of intermediate 
goals (to extend a phrase in speech, to add another errand to a plan), and heuristics to order the intermediate goals. We believe 
that many if not all black board -based problem solvers (and more generally, problem solvers whose long-term goals depend on their 
current situation I could incorporate similar abstraction and planning mechanisms to improve their control decisions. 

The benefits of this approach extend beyond the examples demonstrated in this paper. For example, goal satisfaction and 
problem solving termination are important issues in black board- based problem solvers. Given its underspecified goals of forming 
•good" solutions with its input, how does the problem solver recognize when it has found such solutions or when it can improve 
a solution? The more global view of the problem provided by the abstraction hierarchy helps the problem solver discover areas 
wherf improvements are possible and potentially worthwhile. The enumeration of possible solutions, and the success or failure to 
achieve them, similarly improves the problem solver s ability to determine when a solution is the best of the possible alternatives. 

These mechanisms also nelp a problem solver to make informed decisions about how best to solve a problem tnder real-time 
constraints. The KS models provide estimates of the coat (in time) of possible activities so that the amount of time to achieve the 
next intermediate goal can be predicted. By exploiting the similarities between intermediate goals, moreover, these predictions can 
be generalized over all intermediate goals (making allowances for more or less costly areas as indicated by the abstraction hierarchy) 
and the time needs for the entire plan can be predicted. With this prediction, the planner can modify the plan (eliminate actions 
that unnecessarily increase the belief in a hypothesis, replace expensive actions with actions that inexpensively achieve less exact 
results) until the predicted time costs satisfy the time constraints. 

4 In fact. th« WORD-SEQ knowledge source in the Hearsay- 1 1 speech understanding system essentially is 4 clustering mechanism: by applying mi 
grammatical constraint* about pairwise sequences of words, WORD-SEQ generated approximate word sequences solely to c ontroi 1 he application of the 
more expensive PARSE KS that applied full grammatical constraints about sequences of arbitrary length Jlj 
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Finally, planning and prediction are vital to cooperation among problem eoivert. A network of such problem solvers that 
are cooperatively solving • single prob l e m could communicate about their plans, indicating what partial solutions they expect 
to generate and when. With this information, each problem solver can coordinate iU activities with the others to generate 
and exchange eseful results more efficiently, thereby improving network problem solving performance |12,14,5|. In essence, the 
problem solvers together form a distributed plan. The use of incremental planning, plan monitoring, and plan repair is particularly 
appropriate in such domains due to the inherent unpredictability of future actions and interactions. 

The mechanisms we have outlined in this paper provide the basis for these poeeibilities. We are currently augmenting the 
mechanism with capabilities to perform in more complex, dynamic environments: to modify the abstraction hierarchy when 
important unexpected situations arise and to model and plan for potential future situations (the arrival of more data to be 
processed, the actions of other problem solvers). Our new mechanisms, though they address issues previously neglected, should 
be integrated with other control techniques to be fully flexible, as seen in experiment EU. The combination of our mechanisms 
aad goal processing has proved fruitful, and we believe that our mechanisms could similarly benefit by being integrated with 
other control approaches such as a blackboard architecture for control |4|. Based on the results we have outlined in this paper, 
we anticipate that the further development of mechanisms for developing abstract views and for incremental planning to control 
blackboard-based problem solvers will greatly enhance the performance of these problem solving systems, will lead to improved 
real-time response and to better coordination in distributed problem solving networks, and will increase our understanding of 
planning and action in highly uncertain domains. 
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1.0 iitrodwtloi 


/ k m * m m biy robots combine ths benefits of speed and accuracy with the capability of adaptation to changes in 
the work environment. However, an impediment to the use of robots is the complexity of the man-machine 
interface. * This interface can be improved by providing a means of using apr ior 1 knowledge and reasoning 
capabilities for controlling and monitoring the tasks performed by robots. 

Robots ought to be able to perform complex assembly tasks with the help of only supervisory guidance from 
him an operators. Por such supervisory guidance, it is important to express the commands in terms of the effects 
desired, rather than in terms of the motion the robot must undertake in order to achieve these effects. 

A suitable knowledge representation can facilitate the conversion of task level descriptions into explicit 
instructions to the robot. SUch a system would use symbolic relationships describing the apr lor i information 
about the robot, its environment, and the tasks specified by the operator to generate the comands for the robot. 
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However , the knowledge representation system should provide a scheme for building assembly models in an 
envlronaent that maintains the knowledge of the spatial and functional relationships among the engineering parts 
comprising an assembly. The system then prevents ths user from violating these relationships tnlees specifically 
asked to do so. Thus, the knowledge representation system has a memory for specified constraints and prevents 
actions whose side effects cause the violation of thase constraint* 

The modeling method provides an organisational structure thit maps the relationships between the components 
of an aawambly into a set of constraints. These constraints ar k simplified using a nwber of rules to determine 
the available degrees of freedom for a component. When a not relationship is to be established, the required 
motion is calculated and the feasibility of this motion Is determined by checking against the available degrees 
of freedom. 


2*0 ?*mk Level Progri 


lng 


Task level programming is an attempt to sake use 
that enables the use of apriorl knowledge to interpret 
oriented programming languages Include A OT CP ASS, ] 


structured programming together with an information base 
the commands Issued by the programmer. Examples of task- 
and LAMA. 


The processes are expressed in terms of the desired effects on the objects. Thus, a task is completed not 
when a manipulator has completed a series ot/ motions, but when the objects have reached the desired 
configuration. The knowledge representation systeja converts the task level specifications into manipulator level 
oommwids . 

AUTCPASS (1} accepts the steps of assemblies input from the user and with the help of a model of the world, 
oon verts these into a program that a Manipulator can process. However, the systme does not recognize changes in 
relationships occurring due to manipulator actions. The user is responsible for specifying physically realizable 
operations and also for informing the system/of changes in the attachment relationships. 

RAPT (£, developed at Edinburg), ha i concentrated on specifying the spatial relationships and reasoning 
about them. The relationships between objects such as "B1 against BB" and "Cl against C2", etc., are converted 
into algebraic equations. These equations can be stressed in terms of operational parameters, such as joint 
angles of the manipulator or in terms /of the degrees of freedom for the objects. The solution of these 
simultaneous non-linear equations is quirce complicated and time cons using. However, sene efforts at recognizing 
stereotypes and applying standard solutions have been successful. The maphasis here has been on representation 
of rel at ions hips i however, the geometrical representation Is at present incomplete. Hence, path planning, 
collision detection, etc., cannot be implemented using this system alone. 

LAMA (3) uses general program oflans that are expanded by details of Individual assemblies. Polyhedral 
representation for objects permits trajectory planning and collision detection. The constraints are expressed in 
terms of parameters that are vxispec/fied elements of partially filled transformation matrices. The constraints / 
are represented as oonstraint planes, which indicate the boundaries of permissible motion. Ranges of parameters Aj 
are calculated using volume interactions and constraint plane interactions. The assumption is made that //L 
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different constraints on an object are non- inter feeing. This asstsaption is not required in the system we 
propose, as the bodies always sows within the specified constraints and inconsistent constraints cannot be 
established in a constraint enforcing environment. 

LM {4} and Feature Descriptor (5) represent other examples of task level progressing. 

Fah lean (§ proposed a task planner for robot construction tasks in a blocks' world domain. The information 
contained in the object's motion was not used. The feasibility of the final state is determined from a stability 
viewpoint# but the motion from the initial to the final state la not considered. A task planner built in a 
knowledge base that considers the feasibility of object motion can be Integrated with a trajectory planning 

system. 

In the proposed knowledge representation system# a constraint enforcing environment is provided at the level 
of the database. This approach peraits supporting a task planner as well as interactive sessions with the 
programmer. This approach varies from the traditional approach by transferring some supervisory capability to 
the computer. The feasibility of motion in a constraint enforcing environment is determined by mapping 
relationships into a constraint set# simplifying the constraint# and interpreting the resulting constraints. 

3.1 G eom e trical Modal 

Homogeneous transformations are used to eipress the position and orientation information for objects. The 
position and orientation with respect to the world reference frame is stored with each object. Hence# this 
information In updated only when the object moves. The primitives# eg.# blocks and cylinders# are defined in 
terms of the centroid position and faces. Loci definitions are used as they are sufficient for the reasoning 
involved in constraint simplification as discussed later. 

An assembly is defined recursively as an object consisting of other objects. By Imposing the restriction 
that these primitives cannot move relative to one another# we can create rigid assemblies. We can represent 
mechanisms by permitting specified relative degrees of freedom for the primitives that combine to form the 
aaamably . 

4.0 Relationships 

The objects can be related to one another in a variety of ways with regard to relative aotion. These 
relationships are shown in Piqure 1. The contact relationships involve both concave and convex surfaces. A 
contact relationship between two convex surfaces is called "against*. The against relationship can involve a 
surface# edge# or a point contact between two bodies. A contact relationship between a concave and a convex 
surface is called a "fit". 


MOTION RELATIONSHIPS 





SURFACE . EDGE POINT CONTACT 


Figure li Motion Relationships 


4.1 The "Against" Relationship 

We will first define "against" in a restrictive sense and then generalize the definition. "Against" is a 
relationship established between two plane surfaces# such that translation is permitted in the common plane and 
rotation is permitted about an axis normal to this common plane. An example of against relationship is shown in 
Pigure 2. This "against” relationship exists between face PI of block Bl and face P2 of Block B2. 

This definition of "against" can then be extended to curved convex surfaces, where the contact area reduces 
from the plane to a line. The object with the curved surface retains any rotation capabilities that it had about 
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the body axis normal to tht curved cross-section, prior to establishing the "against” relationship. Similar 
definition applies for spherical surfaces where the contact reduces to a point. 


4*2 Am "Tit" Relationship 

Tit” Is a relationship established between a concave surface such as a hole and a convex surface such as a 
shaft. "Insert" is the driver function to c Demand the establisheent of a "fit” relationship. The hole and shaft 
can be of various cross-sections. The pre-condition for establishment of a "fit" relationship is that the 
profiles of the oonvex and the concave surfaces must match. 

Zn a "fit" relationship, the body containing the hole and the body containing the shaft are restrained to 
rotate about the hole-shaft axis if the profiles are circular. A translation along this axis is also permitted. 

Many special cases of "fit" can than be defined. A restricted "fit" relation ml$\t permit only rotational 
freedom. A rectangular key, on the other hand, is permitted translation but no rotation. 


Representation of other relationships such as threaded joints, etc., can be accomplished by uaing these 
basic relationships. Pcrmissable translation and rotation would no longer be Independent of each other, but 
would be related by parameters such as the thread pit<h. 


The contact relationships of "against" and "fit" and other functional relationships in the motion domain map 
into a set of motion constraints that represent the degrees of freedom for the objects. These constraints are 
represented in our aystem in terms of new primitives which are necessary for reasoning about motion. These new 
constraint relationships represent a general set into which all the relationships that pertain to motion can be 
mapped. An example of this mapping is shown in Figure 1. 



uses mcxriro rexatxon*ixr 


CONSTRAINT REPRESENTATION 


Figure 3: Mapping of Relationships to Constraints 


5.0 Drivers 

The driver functions that command the motion execution are of two types. The first type is the "move” 
faction. The "move" function is used when sane explicit motion is to be implemented by specifying the Mount of 
motion or by specifying the final destination. The second type of drive functions are those which oonaand the 
establishment of relationships. These functions determine the motion required to establish the relationship, 
find out if the motion la possible, and, if ao, then send out the necessary instructions to the robot to 
implement the motion. For the two contact relationships, "against" and "fit”, the driver functions are 
respectively called "establ is h- against" and "insert”. 

Determination of motion feasibility involves a number of steps. This is shown in Figure 4. The 
relationships of the body with other bodies, represented as constraints, are simplified to determine the 
translational and rotational degrees of freedom. If the entire desired notion is not possible with the available 
degrees of freedom, then an attempt is made to find one or more neighboring bodies such that thia set of bodies 
can together accomplish the remaining motion. This strategy is expl ai ned in Section 8. 

4.0 Representation of Motion Constraints 

We need a representation that is sufficient for the reasoning Involved in object manipulation, and is also 
simple and efficient to manipulate. We have departed from the conventional manner (Popplestone, Taylor, Mazer, 
etcj of inverting all the constraints into parametric equations and solving them over all the objects wder 
consideration . 

Let us define a few symbols first: 

T: represents a Translation degree of freedom 

R: represents a Rotation degree of freedom 

L: stands for a Line. This is used as a mnemonic for the axis of rotation or a line of translation. 

P: stands for a Plane. 
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Figure 4s Feasibility of Motion 

These mnwonlcs are combined in the following ways to give the constraints that are established by "establish- 
against” and "insert” operations: 

TPRs this code indicates that Translation is permitted in the Plane and Rotation is permitted about a 

normal to the Plane. The normal to the Plane must be specified to complete the description of the 
constraint . 

TLR« This aode indicates that Translation is permitted along a Line and Rotation is permitted about 

that Line. 

TL: This code indicates that Translation is permitted along a Line and no rotation is permitted. 

RL : This codes indicates that Rotation is permitted about the specified axis. 

The direction vector that is retired with all of these codes is called the constraint vector (CV) . Since 
the magnitude represented by this vector is insignificant, it is stored as a normalized vector. This direction 
is interpreted differently depending on the presence of P or L in the code. 

Besides the above constraint primitives, we must define a few additional constraint classes in order to 
implement a working set. These include three other constraints — "Stationary,” "Attached," and "ri«d" — and the 
unconstrained condition, "Free”. -These are clarified as follows: 

The "Stationary” constraint indicates that the object has a particular position and orientation, an<* these 
cannot change. 

Object A is said to be "Attached” to Object B when A and B always move together. In other words, A is 
rigidly connected to a Every object must store a list of all other objects that are "Attached" to it. When the 
move 001 m and la to be executed for an object, all the "Attached" objects should move together. 

As opposed to the "Stationary” and "Attached” constraints which are specified by the user, "Fixed" is a 
constraint derived from a set of user specified oonstraints. "rimed" represents the condition when a combination 
of constraints applied to an object results in no freedom of movement for the object when considered as a single 
entity. However, the implication is that the object could move if it were to move together with one of the 
constrained objects. This is clarified by the example shown in Figure 5. 

Consider the case when the following constraints are established: 

B1 AGAINST B2 ( F2 FI) 

B2 AGAINST B3 (F3 F4) 

The features involved are Indicated within parentheses. 

Under these two constraints, B2 can translate along a line in and out of the paper. Now, suppose we 
establish a third relation B2 AGAINST B4 (F5 75) in which B4 is behind B2. The result is that B2 is now "fixed" 
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Figure 5: B2 "Fixed* By Constraints 

and cannot move by itself. However, any of the combinations (B2,B4), (B2, B3), and <B2, Bl) can move along 
different directions. 

The constraints only describe the restrictions that the environment has on the object's motion. The driver 
functions, the "move" conn and , and the commands to establish relations use these constraints to determine whether 
the body can be moved or some other body needs to move along with the body prior to transmitting the ooamand to 
the robot. By using these constraints, we can build assemblies in a constraint enforcing environment where the 
user or the task planner is prevented from violating any previously specified constraints, unless the system is 
specifically asked to do so. 

In order to determine the available degrees of freedom, we will first show how to simplify the motion 
constraints in the next section. Following that, we will develop a method of determining the available Agrees 
of freedom from the specified constraints. 

7.0 Simplification of Constraints 

Seme rules for simplifying constraints are presented here. These apply to the case of rectangular blocks 
and cylinders. The extension to other bodies involving curved surfaces and to more general profiles Is posable. 

The rules may not represent a complete set, but are sufficient to demonstrate the f easibi 1 ity of the 
approach. Thie rules which follow are for a prototypical object B1 which is being commanded to move by some 
driver function si Rifles is mapped into); 

1. IF (TPR CV1) AND ( TPR CV 2) AND CV1 X CV2 i 0 
THEN (TLCV) AND CV - CVl X CVl 

Consider the motion for B1 in the following exsaples (Figure 6). 

32 A® INST B1 (F2,F1) (TPR CVl) 

B3 AGAINST 94 ( F4, F3) -*» (TPR CV2) 

Since CVl and CV2 are orthogonal to each other and lie in the plane. Bi can translate along a line in and out of 
the plane. 

2. IF (TPR CVl) AND (TPR CVl) AND CVl X CV2 - 0 
THEN (TPR CVl) CR (TPR CV2) 

Consider the motion for Bl in the following example (Figure 7a). 

92 AGAINST Bl ( F2, Fl) (TPR CVl) 

B3 AOUNST Bl (F4,F3) (TPR CV2) 
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Figure 61 B2 Agiirwt Bl (12, 11) 
B3 Against B1 <F4, P3) 



Figure 7a * B2 Against Bl (P2, Pi) 

B3 Against Bl (F4, F3) 

BL can still translate in the plane perpendicular to CVl and rotate about CV1. Using CVl or CV2 does not natter 
As shown in Figure Tb, CVl and CV2 can also be pointing in the sane direction. 
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Figure B2 Against Bl (F2, PI) 

B3 Against Bl (F4, F3) 

3. IF (TLR CVl) AND (TLR CV2) AND CVl X CNG 1 0 
THEN FIX 

Consider the motion for Bl in the following example for a "fit* relationship between a shaft and a hole (Figur* 

«) . 

B2 FITS Bl <S2, HI) -v- (TLR CVl) 

Bl FITS B3 (SI, H3) ^ (TLR CV2) 

Bl cannot move all by itself. However, Bl and H2 can move together as can Bl and B3. 

4. IF (TLR CVl) AND (TLR CV2) AND CVl X CV2 - 0 
THEN (TLR CVl) OR (TLR CV2) 

Consider the motion for Bl in the following exaaple (Figure 9), 

Bl FITS B2 (SI, K2) (TLR CVl) 

HI FITS B3 (SI, H3) (TLR CV2) 

Until now, we have been considering infinite lines and planes and, hence, no mention has been made regarding th 
depth of the hole and the length of the shaft. These parameters will be introduced later as factors necessar 
for determining the possibility of establishing relations retired for object manipil ation. 
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Figure 8 1 B2 Fits B1 (82, HI) 
BL Fits B3 (SI, H3) 



Figure 9 1 Bl Fits B2 (Si, K2) 
HI Fits B3 (SI, H3) 

5. IP (TPR CV1) AND (TLR CV2) AND CVl X CV2 • 0 
THEN (RL CVl) OR (RL CV2) 

Consider the motion for Bl In the following example (Figure 10) . 

B2 AOMNST Bl (F2,Fl) ( TPR CVl) 

Bl FITS B3 (SI, HQ -♦ (TLR CV2) 


CV2 



Figure 10: B2 Against Bl (T2 t FI) 

HI Fits B3 (SI, H3) 

Bl can rotate about CVl. B2 AGAINST Bl permits translation in e plane while BL FITS B3 permits translation along 
the normal to the same plane. The net result is that Bl cannot translate at all. 

6. IF ( TPR CVl) AND (TLR CV2) AND CVl X CV2 f 0 

THEN, IF CVl - CV2 - 0, THEN (TL CV2) , ELSE FIX 

Consider the motion for Bl in the following example (Figures 11a and lib). 
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Fiqure Ua: D1 Arjalnst I*?. { H , F2) 

m pitn in ( :;i, in > 



Fiqure 1 Hi t Ml Aqa I nst M2 (FI, F2) 

Ml Fits Ml ( M , HI) 

Ml AGAINST M2 <Fl,F2> (TVR CVl) 

ni fits M3 na, mi h»» (ti.k cv2) 

MI can translate a 1 onq CV2 an shown in Fiqure I la lor CVl 0V2 « 0. IkMnvor , an ntown in Fiqure lll>, when 
CVl CV2 / 0, Ml cannot move at ail. 

7. rP STATIONARY AND ANY ONE ttJNSTMA INT 
THEN STATIONARY 

If a body is stationary, it cannot move at all. 

8. IF FIX AND ANY 01® CONSTRAINT KXCtrT STATIONARY 
THEN FIX 

If a hody is constrained so that it cannot move by itself, then conntrni ni nq it further wit I not ohanqe the Fit 
constr ai nt . 

9. rF FREE AND ANY ONE CONSTRAINT C 
THEN C 

Free refers to an unaonstrai ned state and, hence, c is the only one constraint. 

The relationships of a tody with other todies are stored symbolical 1 y usinq the constraint, primitives 
discussed ahove. When motion feasibility is to be determined, the different re 1 at i onr; hi pn are first amplified 
usinq these rules. If a qroup of Indies moves together, then al 1 their relationships with other objects ar*- 
simplified in order to determine the feasibility of motion of the qroup as a whole. 

The simplification l r. compl eted when we have all the possible translational and rotat;sr.al deqree?; : 
freedom for the hody or set of todies. All the constraint vectors are converted into the world reference f rane 
before simplification. This approach is preferred to updating the constraint vector every time the position of 
the ohject changes, since the numher of updates required is reduced. 

The next section discusses the implementation strateqy for motion and f or the est-ibl istanent of relations. 

8.0 Strateqy for Irrpl*»ei»tinq Motion and for XstabI ishnent of ■elatiormhlpa 

Every pr i mi ti ve object has an ar hi tr ar i 1 y se 1 ected reference point. The position ol this point and t^» 
orientation of the tody a tout this point an measured in the world reference frame is cons i tor ed as the pr>s l r i r. 
of the object. The position and orientation of features are desert tod relative to this reference to 1 nt . 

bet us first consider the motion of a primitive object. When the mot i on of this tody to a par* i cu 1 >r 
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position and orientation is desired, the Motion Matrix is given by 
MOTION • ((XD-POSITION )" 1 <«W-PCBITION) 

, using the homogeneous tr ana format ion notation. This is represented by the dlaqran in Figure 12. 



OLD-P08ITION 

Figure 12: Calculation of Motion Matrix 


S.l Establishing Relations 

Driver functions such as "es tabl ish-aga i net* and "insert” are used to establish relations between objects. 
These commands are used to calculate the necessary notion matrix which is used to move the body in ocdci to 
establish a relationship. 

Consider the diagram shown in Plgure 13. Let us define the terms used in this diagram: 


P0SB1 : 
0LD-P0SB2 : 
NEW-PC6B2 : 


MOT I CM : 

FEAT1, FEATS: 
REL-ESTABLISH: 


position of object Bl in the world 
position of object B2 before moving 

position of object B2 after the relation has heen established. This in rodetermi ned at this 
stage . 

The motion regu ired of Cb ject B2 to establ ish the rel at ion. This is yet to he determ i ned. 
position of features on bodies Bl and B2 that are involved in this relation, 
a matrix that orients the constraint vectors to establish the relevant relation. 



0BJECT2 RELATION OBJECT! (FEAT2.FEAT1) 


Figure 13: Establishing Relations 

In the cas e of the "aga i ns t" re 1 at ions hi p, the REL-ESTAB LIS H mat r i x woo 1 d def i ne t he rotation needed to 
bring the normal vectors associated with the two faces in line with each other. In the case of the "fit" 
relationship, it is the matrix that brings the two axial vectors in line with each other. 

To determine REL -ESTABL IS H, consider the example in Figure 14. Consider two unit normal vectors Nl and N2, 
associated with the faces Fl and F2 of two objects Bl and (£. 


Nl 



TO MAKE Bl AGAINST B2(F1.F2) 

ROTATE B2 BY ACOStNl J42) ABOUT Nl X N2 


Figure 14: Definition of REL-ESTABLISH 

HI and N2 are two vectors to be aligned. A rotation of Nl by an angle *cos“* (HI fC) about the direction 
Hi x H2 will bring them into line with each other. This rotation is the one defined by REL -ESTABL ISH (Equation 
1). An additional translation is required to bring the two faces together. NEW~PQSB2 incorporates the combined 
rotation and translation of this new position for B2 (equation 2). MOTICN is calculated using this new position 
NEW-P0SB2 (Equation 3). 


0 
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RE L -ESTABLISH 


ROT COS' 1 { NL N2) , Ml XN2 


0 


0 

0 0 0 1 
[«W-PC6-B2] - (PC6B1) ( FEAT1) { RE L -ESTABLISH) (nEAT2)~ l 

[mCTTIOnJ - (0LD-PCB-B2)” 1 (1EW-PC6-B2) 

The following * loop equation* can be used at any tlaie to determine if the relation has been established, 
the equality is satisfied, then the relation has been established. 


U) 

( 2 ) 

(3) 

If 


^IEENTITyJ - (PC6B1) (fE ATI) (FEAT2)* 1 (PC6B2)* 1 { 4> 

The next task is then to find out if this motion is feasible in the presence of the already established 
constraints . 

However, there is one factor that still needs to be considered. The above-mentioned strateqy for 
establishing relations will work only if object Bl does not move while object B2 is being moved. If object B1 
moves in the process of achieving the required motion for object B2, then the relation is not yet established. 
This is explained by the following example (see Figure IS). 


rt .n 




Figure 15: Example of Unachievable Relationship 

The given constraint is Bl ATTAOED B2 (F3 F4) . We try to establish the relation EQ AGAINST Bl (FI FI). 

The motion needed by B2 to establish this relation is calculated, and in this case, it is simply a 
translation in the x direction. When this motion is attempted, it is found to be feasible and, hence, is 
implemented. Bit since Bl and GQ are "attached", Bl will a 1 so move and, hence, the desired r el ationship wi 1 1 not 
be established. In fact, for this example, the relation can never be established since the two bodies are 
attached and, hence, move by the same amomts. 

This problen is resolved by using an iterative procedure. Consider the diagram shown for the general case 
i n Fi gore 1 6. 

In the first attempt, motion M is the required movement of object B2 to establish the relation. However, in 
the process of planning motion M, motion Ml of object Bl also occurs. At the end of motion M, the "loop 
equation" (Station 4) is used to check if the relation has been established. If the relation is established, 
the new motion matrix will indicate zero translation and rotation. If the relation is not established, then an 
iteration is performed in which NP2 is redefined as OP2, Pl* is redefined as Pi, the new rel-es tatl ish matrix is 
given by FE\ and the motion matrix is recanputed. This is shorn as M2 in Figure 16. By canparing the motion M 
with motion M2 (as shown later), one can find if any progress has been made towards establishing the relation. 
If no progress is made as in the example in Figure 15, then we conclude that the new relation cannot be 
established in the presence of the al ready establ ished relation. 

In the above iterative approach, it is necessary to determine if progress has been achieved in bringing the 
two objects closer to each other. We assume that the iteration is successful if the new motion prediction is 
"smaller" than the "initial" motion prediction. A mechanism to compare an initial motion prediction with a new 
motion prediction is described in more detail in reference (7). 
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figure 16 1 
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old position of body 1 

new position of body 1 

notion of body 1 

feature of body 1 

feature of body 2 

old position of body 2 

new position of body 2 

new position desired foe body 2 

Motion of body 2 

notion nssdsd foe body 2 

Old REL -ESTABLISH 

new REL-CS7ABLZSH 


B.2 PsMsibility of Motion 

In order to decide whether s particular body notion is feasible or not , we need tc consider the constraints 
to which the body is subjected. We have shown how to simplify these different constraints in order to determine 
the posai bl e degrees of freedom. In order to facilitate the process of determining motion feasibll ity# tbs 
translation and rotation are each handled separately. 

If, after simplification has been performed, the constraint code includes TU then a part of the translation 
needed can be accomplished by moving the body along the direction given by the constraint vector. If the 
constraint aode includes TP, then a part of the motion is achieved by translating in the specified plane. If the 
entire desired translation is not possible, then one cannot achieve the entire motion by moving this object 
alone. One must then consider moving a neighboring object along with the body of interest. This strategy is 
described later. 

The rotation part of a given motion matrix is converted into an equivalent angla about an axis locatad in 
the world reference frame. The constraint vector associated with the rotation dagraa of freadom is then compared 
with this equivalent axis. If the two coincide, then the rotation is possibles otherwise, the rotation is not 
possible by this body alone. 

If the body is foind to be constrained from moving when considered alone, then the following strategy for 
considering cooperative motion with neighboring objects is pursued. A search similar to e breadth first search 
is used. Thi s is best ejqplained by an example (see f'i^ire 17). 
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Figure 17: Cooperative Motion 

Consider an attempt to move the object Bl. aippoae the following relations alraady axist: 


in 



•1 I* in contact with 12 and SI* 

12 Is In aontsct with B4 and 15. 

9m first attempt Is to find If Bl can bs moved fay ths r«silr«d asoist all a Iona. If this Is not possible, 
than an attaint is nada to oonsldar tha notion of Bl and B2 together. in tha constraint simpl If ication that la 
par fora ad, tha constraints between tt and B ara not oonsldar ad in this ossa as 11 and B do not havw relative 
notion* 

If 11 and B cannot accomplish tha aatira remaining notion together, than B and B ara oonsldar ad together. 
If tha antlra notion is still not implemented, than 11, B2, and 11 ara oonsldarad together. Tha saarch ordar 
us ad is (11), (11,12) , (11,13), (11,12,13), (11,12,14), (11,12,15), (ll,B,13,M), (11, 12, 13, 15) , AND 
(11, 12,13, 14,15). 

Tha sat (11,12,13) is oonsldarad bafora (11,12,14) sinoa 12 and 13 ara in diract contact with 11 whidi Is 
tha objact of Intarast. Tha governing tula in this saarch is to always attsnpt to nova a nininun nun bar of 
bodies prior to attempting a larger sat. 

At each sta 9 a in tha saarch, notion Is carried out as far as possible. If there is still sons notion 
inquired, than tha saarch continues. As a result, tha total desired notion for 11 night be achieved partly ty Bl 
alone and portly fay fit and B together. 

If a constraint "Stationary" is encountered in tha search, tha search along tha path is immediately 
terninated. Note that all tha attached bodies ara always considered together in this notion feasibility 
psradi^i. 

the structure is described as a tree diagram. However, ths general istd version can be a graph (sea rigors 
IBS). This graph can bs oonverted into s tree by repeating appropriate node# (see Figure 1S>). 
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(s) (b) 

Figure IB: Conversion of Graph to s Tree 

9.3 Motion of Assemblies 

To determine the motion poss i bi 1 i ti as for an assembly, we have to treat them as an "attached" group of 
primitives. Ne store the position and orientation information for each primitive. When the assembly has to 
move, all ths primitives must move by ths same amount. 

1.4 Range of Motion 

In our discussion so far. we have considered the constraints as basically degrees of freedom. For instance, 
TPR indicates that translation is allowed in a plane and rotation is permitted about the normal to the plane. 
However, theae motions are not Infinite. These motions are limited by the finite dimensions of the bodies 
involved In the relationship. 

*9. 0 ttplmnamtation 

Ths knowledge representation system described here has been implemented using ths FLAVCK system on the LW 
LISP Machine. A typical data network is shown in Figure 19. In the constraints discussed above, we did not 
Include the range of motion. Geometric algorithms to detect col 1 ision of objects and over lapping of faces, 
overlapping of two holes, etc., are implmaented as tests to bs conducted before sending any motion command to the 
robot. This scheme provides for expandability of the system to incorporate checks such as tolerance matching, 
surface finish matching, checking types of fit, calculating inertia properties, etc. Some examples can be fowd 
in reference ( Jj . 

The system described thus far is s constraint enforcing system. The breaking or making of relationships as 
s side effect of motion is not considered. This consideration of dynamic constraint propagation requires a 
constraint network on a global scale which was indeed designed end lmplamsnted (7), but is out si da ths a CD pi of 
this paper. 
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Tht rtctmgl« indicate data 
objects. The ovals indicata 
Information slots. Ovals 
connecting two data objects 
indicata inforaatlon slots In 
both data objacts. 


Pigure 19b: Data Network for Nodal Shown in PlgurS 19S 


Not a: This is not a ooaplata natwork as soaa rapatitiva structures are not shown. 


18.0 Conclusion 
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The applications of tfels knowledge representation aystem/Jar* sanifold. The programming of robots using 
higher level constructs hides the robot specific progressing details frca the user by subaerging than into the 
doaain specific knowledge base. The constraint enforcing environment provides for an on-line debugging facility 
which operates at the conceptual task level rather than at the programs syntax level. Before sending a c owe and 
to a robot , the conn and is slaul ated in the world mode 1 , and this provides for s real tiae error pr event 1 ve 
capability, with the addition of dynamic constraint propagation, the knowledge representation syatmi becomes 
capable of simulating simple assembly operations, this providts s basis for the development of a task planner. 
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Real Artificial Intelligence for Real Problems 

R.S. Doshi 

Jet Propulsion Laboratoiy 
California Institute of Technology 
Pasadena, CA 91109 

Thl9 Is a aumiary of a panel discussion that Mae held at thepon^lusion of the 
sessions on artificial Intelligence (AI). P anel wwn t ery i rt srs:- J ? . Chewrrmwn , 
N ASA toes: S. Harmc n.-Robot Intel ligence Interrw^or^^ ^ Haywerd/ McGill 
University; K, Kespf , FMCf\J. Gppenhelm, C^^^^ Sanderecn, CMU; andD. 
MUklnsu_JSRI . 


The following queeticne were dlsc u s s cjpfry-'thetenel ; 1) What other re s e a rch 
areas need to he addressed to extend yc^~Siork7^__2)Tn your re s e a rch 
experience, have you found It fruitful to choose appliSrtton~-dcc!alra frcm 
reel -world scenarios? 3) The t^lerobot progr a m needs to merge task planning 
and spatial reasoning. Wtmr do you think need s to be done to achieve this 
merger? 4) Is the blocks world really slmple/trivlal? Responses and 
discussion on these lssgds are surnnarlzed 4 bsdfi* . 

The blocks world dorfain is definitely not siwple. Mwiy AI planning systems 
have solved many .swpects of the toy-blocks world. Hewever, the real-blocks 
world Is largely unsolved. The reel-blocks world problem includes the 
following subproblems: 1) Representing geometries of the workcell, the 

robot (s) , the objects, assemblies of objects; 2) Practical techniques for 
efficiently reasoning with these representations; 3) Developing mathematical 
and symbolic models of sensor hardware and Interpretation of s ensor behavior; 
4) Developing mathematical and symbolic models of external agents like 
friction, dynamics, kinematics and wear and tear. Although these areas are 
undergoing active rese a rch, results are far frcm complete. 


Nowadays, it is inperative to choose drmalns frcm real-world scenarios. This 
helps make for good demonstrations. However, to actually make it realistic, 
the application must also include one or more (to a certain manageable degree) 
subproblems of the real-blocks world (disc uss e d above) . An example of such a 
realistic problem is to investigate an AI task planner which also reason s 
about the (numeric) geometric representations and plans actions based on real, 
actual, existing robot hardware. Even if the AI task planner is sinple, 
solution of the combined problem adds a lot to the state-of-the-art. 


One of the important unsolved problems is that of integration of diverse 
technologies in the fields of mathematics, computer science, cognitive 
science, artificial intelligence, mechanical engineering, robotics, electrical 
engineering, etc. 

Integration of AI with non-AI systems is something which Is talked about a lot 
and postponed. Many theoretical suggestions do exist. The general feeling 
seems to be that the integration will not be a batch-mode Integration ("do- 
thls — t hen-do- that" ) • It will be a multi-contact, heavily coupled 
interaction. The inportant thing is to start integrating and failing. Fail 
erough, and often enough, so enough can be learned. 


IW|H5 


if*f mwMsimiwari y I feftM 


TRAJECTORY PLANNING FOR MANIPULATORS 
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The Use of 3-D Sensing Techniques for On-Line 
Collision-Free Path Planning 

V. Hayward, S. Aubry, and Z. Jasiukajc 
McGill University 

Montreal, Quebec, Canada H3A2A7 
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Abstract 

pnj i#r ike. Stott of tht art in collision prevention for manipula- 

tors with revolute joints, showing that it is a particularly computationally hard 
problem } Based on the analogy with other hard or undecidable problems such as 
Keortm proving , t O^pmpsm^ an extensible multi- resolution architecture for path 
planning, based on a collection of weak methods^F inally, ws s aamitee the role 
that sensors can play for an on-line use of sensor data* 1 
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1. Introduction 

Automated collision prevention for robot manipulators is an 
essential feature seldom available, even in its simplest forms, 
in today’s robotic systems. The term “collision prevention" 
will refer to collision detection and collision-free path plan- 
ning. This paper presents current issues in collision preven- 
tion for manipulator robots with revolute joints in relation 
to the use of sensors. 

Given the difficulty of the problem, only partial solu- 
tions have been found. Furthermore, we believe that it is 
particularly constraining to have to assume that robotic sys- 
tems will operate in perfectly pre-m odeled environments, as 
do all the currently existing industrial systems. As a conse- 
quence of the “perfect model" approach, many largely un- 
solved issues immediately arise: uncertainty representation 
and assessment, time-varying environments, computational 
and storage complexity. 

We will rather advocate an “imperfect model" approach 
that will lead us to the consideration of a multi-level sys- 
tem heavily relying on sensory information and using multi- 
resolution algorithms. Instead of elaborating a theoretically 
exact method, and then deriving the computational and stor- 
age complexity in order to design a computer system to im- 
plement it, we will rather present methods that only par- 
tially solve the problem, but whose performance improve as 
the computing power is increased. Studying the problem of 
collision path-finding in connection with the available sens- 
ing techniques also provides some insight into the solution. 

2. Current Methodologies for path planning 

Experimental research in path-planning in the context of 
mobile robots has been fo far more successful because of the 
reduced complexity of the two-dimensional case. However, 
many concepts developed in the two-dimensional case do not 
extend readily to the three-dimensional case. For example, 
the conceptually attractive “configuration space approach" 
fails to extend easily to the case of manipulators with rev- 


olute joints, whereas it applies very nicely to the case of 
mobile robots. The reason is that the Cartesian space, in 
which obstacles — or free-space — are described, maps very 
awkwardly into the configuration space. In case of a manipu- 
lator, the configuration space is equivalent to the joint space. 
The mapping is highly non-linear and occurs between spaces 
of different dimensionalities. The computational complexity 
becomes unmanageable beyond three joints for the problem 
of mapping the Cartesian space scene into joint space as well 
as for searching the resulting graph. The approach is limited 
to three joints (Lozano 1986, Gouzene 1984) even if recursive 
decomposition schemes are utilized (Faverjon 1985). 

Another widely adopted approach (Khatib 1986) is of lo- 
cal nature and consists of the computation of a artificial field 
of potential increasing near obstacles, and globally decreas- 
ing toward a goal position. Pseudo-forces are then included 
in the low-level motion servo computations of the manipu- 
lator. As a result, the manipulator is controlled to move 
away from obstacles and toward the goal position. Unfortu- 
nately, the method breaks down in obstacle configurations 
that create local minima. Computational complexity also 
precludes attempts to enlarge the scope of this method. The 
great attraction of this idea is the possibility to use sensor 
data directly for the computation of the potential instead 
of an a priori model. Similar schemes can be formulated 
in kinematic terms, that is in terms of velocities, instead of 
forces. The problem of local minima can be largely elimi- 
nated by the use of redundant manipulators: the trajectory 
of the end-effector can be complete!) specified and the re- 
dundant linkage?* used to avoid obstacles u?*ing onl> local 
information (Maciejcwski 1985). Operation Research has 
been also considered helpful to attack the complexity prob- 
lem (Grechanovsky 1983). 

The methods of the last categor) resort to limit the 
class of tasks being planed. For example, the range of mo- 
tions can be restricted to pick and place operations (Brooks 
19836). Similarly. the range of obstacles can be restricted, 
for example to pillars shapes {Luh 1984). 
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3. Proposed Ideas 

We would like to suggest a few new ideas to the problem 
of collision prevention, in the view of their use in an on- 
line control system. We mean by that, trajectory generation 
techniques that allow the computation of collision-free tra- 
jectories in the same amount of time as require by the mo- 
tion of general purpose robots, using limited computational 
resources. At this point, we would like to draw an analogy 
with the problem of theorem proving in computational logic. 
This problem is undecidable, that is to say, if the proposition 
submitted to the system is in fact false, the result cannot be 
obtained in a finite number of steps. 

The search space to explore in order to prove a propo- 
sition can become arbitrarily large. If the proposition under 
study in indeed provable, efficient methods in theorem prov- 
ing attempt to use powerful heuristics to reduce the search 
space. These heuristic methods are called “weak methods” 
because they do not guarantee success, but are likely to con- 
verge in most interesting cases. If the proposition is false, 
this conclusion may not be reachable in a finite number of 
steps, and one has to resort to cut the search at some arbi- 
trary point and to assume that the proposition probably is 
false. 

In path planning, there are many heuristics available, 
and we suppose a limited amount of available computations, 
hence the architecture described below, based on a collection 
of weak methods. 

3.1. Computational Architecture 

We require the system to be extensible in the sense defined 
by Brooks (Brooks 1986). As researchers are devising new 
methods to calculate collision free trajectories, we would like 
to be able to integrate these advances while causing a min- 
imum of disturbances to existing and working parts of the 
system. 

The following diagram illustrates the design concept of 
an extensible architecture. The question of whether each of 
the methods will reside on one or several processors is of lit- 
tle importance. What is is important is to design them as 
peers such as they can accept the same input and produce 
the same output formats. The crucial point is not to attempt 
to parallelize the computations of one particular method, be- 
cause we know that many of them require exponential times 
to execute, but to parallelize the methods between them so 
that we obtain a natural selection of the most appropriate 
for the situation at hand. 


The computations should be done at a least two levels. The 
top-level searches for collision-free trajectories, using any of 
the methods available. Input to the high-level is data ob- 
tained from global sensor measurements as discussed in the 
section “Sensors,” or from pre-determ ined models obtained 
from data-bases. The lowest level is a local collision detector 
that also uses either sensor information or pre-determ ined 
model information. We will require an efficient collision 
detector to certify the proposed trajectories, or use on-line 
proximity sensor mounted on the arm to locally modify the 
trajectories as they are executed. In the later case, we take 
the chance that the motion may never terminate. 

3.2 A Variety of Heuristics: Archetypical Motions 

The study of robot motions shows that given an approxi- 
mate description of a robot's environment, and given the ini- 
tial and final configurations, robot motions can be classified 
into classes of archetypical motions. A preliminary analysis 
shows that collision-free motions bear a strong relationship 
with the structure of the workspace. This relationships can 
be exploited to built a system that infers plausible motions. 
In this framework, the problems under study are: 

- Classification of obstacles according to the influence 
that their shape might have on the motions: small, 
large (with respect to the robot), compact, elongated, 
fiat, etc... Interesting simple cases are: spheres, infinite 
cylinders, half spaces, and holes. 

- Classification of the relations between these obstacles 
with respect to the robot elements: proximity, position 
with respect to the elbow, under, above, on the left, on 
the right, etc... Inference will occur on criteria of this 
kind. 

- Classification of typical motions: retraction, extension, 
sweep, wrist re-orientation. The consequences of these 
motions are explicit: if joint No 1 turns in the positive 
direction and the arm is stretched, then end-effector 
sweeps on the left; if the arm retracts and is in the elbow 
up configuration, then the end-effector moves inward, 
and the elbow moves upward, etc... 

Once the scene and the robot attitude are encoded in 
terms of facts and rules, motions can be generated by auto- 
matic inference. 
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3.3 Joist Decoupling 

Joint decoupling is another way to attack the problem. The 
observation of certain collision avoiding motions reveals that 
motion planning can be performed by planning the motions 
of the joints independently. During such a motion, in the 
reference frame attached to esch link of a robot, all the ob- 
stacles appear as moving obstacles. The task consists for 
each link to plan a one-dimensional trajectory in its own co- 
ordinate frame with a time-varying environment. We know 
from (Kant 1984 and 1986) that such a planning is possible, 
by planning the velocity along a predetermined path. This 
algorithm finds solutions in a large number of cases, when 
the priority among the set of joints is adequately determined. 
The problem is formaly equivalent to moving multiple ob- 
jects as in (Erdman 1986). 

3.4 Piece-Wise Trajectory Decomposition 
Another heuristic method can be described as follows. If 
the arm is to move from point A to point B, a trajectory is 
generated at the first iteration using a very simple scheme: 
a linear joint interpolated motion between A and 0, for ex- 
ample. The trajectory is then verified. In case of collision, 
an in ter mediate knot point is generated by the closest non 
interfering position. The initial segment is then split and 
the process recursively iterated on the sub-segments. 

3.5 The generate- test- refine architecture 

We have just listed three powerful heuristics to reduce the 
search space of the problem. There exist others. We can 
augment the power of these heuristics by feeding back to a 
motion planner information provided by the collision detec- 
tor in case of the failure of a plan, or information provided 
by a merit estimator, in case of success. The system is left 
interating during the allocated time period, the last best 
solution being retained. 

3.5 Good Collision Detectors 

Of what precedes, we require a good quality collision de- 
tector, that is to say, one that does not require exponential 
nor polynomial times to perform and one that uses multi- 
resolution algorithms. This problem has been examined in 
(Hayward 1986), One approach is to perform the modeling 
the robot in terms of contro* points scattered on its bound- 
ary. Collision detection can be then performed by showing 
that all the control points are in free-space. (note that there 
is no need to worry about rotations). A multi-resolution 
system can then be easily obtained. 

The quality of the result augments with the allocated 
running time and the CPU power. Methods for generating 
multi-resolution control points are indicated in (Bhan 1986), 
Octree encoding methods provide very naturally for multi- 
resolution algorithms, however, we have other schemes under 
consideration because octree make no use of the coherence 
that might be present in a scene and therefore can lead to 
great inefficiencies. 


4 . Sensors 

“Model Building Sensing" is used to gather global three- 
dimensional information from the environment. In a robotic 
context, the sensors perform a “surveying” function, provid- 
ing information to be used by the path planning module. 
This is quite different from the on-line uses of sensors in 
which the local environment is continuously sampled so as 
to avoid crashes. In particular, model building sensors must 
operate over a wider range than their servoing counterparts. 

4.1 Global Sensors versus Proximity Sensors 

The chosen sensor must be either a proximity sensor at- 
tached to a “roving” arm or it must be capable of acquir- 
ing three-dimensional information at a distance. In the first 
case, the accuracy is limited only by that of the manipula- 
tor. However, control problems are likely to crop up for com- 
plex environments where concavities abound. Such problems 
arise because the environment is not known a priori: in fact, 
the environment is difficult to explore precisely because it is 
not known! Consequently, such a process is likely to be a 
slow one. 

We contend that such proximity methods are only ad- 
visable when the task environment is so intricate that spatial 
considerations prevent larger apparatus such as those we de- 
scribe below to conveniently operate. Suppose for example 
that we wish to model the bottom of a narrow, oblong cav- 
ity inside a given object. We can safely assume that our 
robot arm can indeed penetrate the cavity and orient itself 
within it, since otherwise there would be little point in mod- 
eling it. Using the very manipulator which is to perform the 
robotic task is then the most direct way to model the task 
environment. 

4.2 Acquiring 3-D Information 

Techniques developed for acquiring three-dimensional infor- 
mation at a distance are still the preferred answer to auto- 
mated model-building in most cases. These techniques can 
be either photometric or telemetric. 

4.2.1 Photometric Techniques 

Photometric techniques attempt to infer distance from pho- 
tographic images. Bui such images map intensity, an extrin- 
sic characteristic of the three-dimensional world, onto the the 
two-dimensional plane along the lines of a perspective pro- 
jection. The task of recovering the correct interpretation for 
a given image is then a formidable one since it requires that 
the perspective ambiguity (lines map into points) be resolved 
from the intensity cue alone; formally, this task consists of 
inverting an illumination-reflectance operator / which maps 
the three-dimensional scene to the image plane. The so- 
called “shape-from” techniques attempt to perform that dif- 
ficult inversion using a combination of analytical work (Horn 
1968; Horn 1975: Ferrie 1986; Levine, O’Handley and Yagi 
1973), and of higher-level cognitive processes (Rosenburg, 
Levine and Zucker 1978; Bajcsy and Lieberrnan 1976: Shirai 
1973: Nlarr 1976|. -These methods have been much investi- 
gated in part for their similarity to human visual processing, 
but also because they do not require sophisticated optical 
hardware. 
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4.2,2 Telemetric ‘Technique* 

In contrast with photometric techniques, telemetric tech- 
niques usually require specialized hardware but are much 
easier to analyze in return and therefore constitute a much 
preferable means for automatic three-dimensional scene ac- 
quisition. The goal here is to build “range images": a range 
image maps the distance of the closest point in the scene to 
every node of an orthographic grid the size of that scene. 
These images are usually constructed by monitoring pat- 
terns of points (Hasegawa 1982; Ishii and Nagata 1976), lines 
(Oshima and Shirai 1979: Sato and Inokuchi 1985), or grids 
(Potmesil 1979) of light which are successively projected onto 
the scene and reflected to a sensor located at or near the light 
emitting device (often a laser). Either positional analysis of 
the returning rays or time-of- flight discrimination can now 
be used to infer the range of the closest obstacle. In the first 
case, simple geometrical relationships relating emitted and 
returned rays yield the sought distance in a process called fri- 
angulation. In the second case, the time taken by light rays 
to travel from and back to the emitting laser source allows 
us to calculate that same distance. Needless to say, the prac- 
ticability of the latter method is limited by the very sophis- 
ticated electronics that the enormous speed at which light 
travels requires (Lewis and Johnston 1977; Nitzan, Brain 
and Duda 1977.) 

An alternative time-of-flight method uses sound waves 
instead of light rays because of their more manageable speed. 
Although simple in principle, the method suffers from vari- 
ous engineering problems such as the need for frequent recal- 
ibration. the difficulty experienced in focusing sound waves, 
as well as their hard-io-model reflective properties. 

In summary, the “safest" and most accurate methods 
of acquiring distance information seems at present to be 
triangulation. However, one should not discount ultrasonic 
time-of-flight methods which are already commercially avail- 
able. Further, many researchers believe that laser time-of- 
flight methods will soon present itself as the most viable 
method since it offers in theory the greatest absolute ac- 
curacy. The interested reader should refer to the excellent 
review by Jams {Jarvis 1983) for further reading on range 
acquisition techniques. 

5. Conclusion 

In this paper, we have presented an overview of methods re- 
lated to the collision prevention for manipulators with rev- 
olute joints. It has been shown that it is a difficult problem 
in its generality and we have proposed a computational ar- 
chitecture based on an analogy with an another domain of 
Artificial Intelligence. 
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1 . ^Abstract 


Collision-free trajectory planning for robotic manipulators Is lnvestl gated f In th te 
The task of the manipulator is to move Its end-effector from one point to another point in an 
environment with polyhedral obstacles. An on-line algorithm is developed based on finding the 
required Joint angles of the manipulator, according to goals with different priorities. The 
highest priority is to avoid collisions, the second priority is to plan the shortest path for 
the end effector, and the lowest priority Is to minimize the Joint velocity for smooth motion. 
The pseudo- Inverse of the Jacobian matrix is applied for Inverse kinematics. When a possible 
collision is detected, a constrained inverse kinematic problem is solved such that the collision 
is avoided. This algorithm can also be applied to a time-variant environment. 


2. Introduction 

Ordinary tasks for a robotic manipulator are to move its end effector from an admissible 
point to another admissible point in an environment with obstacles. For that, the initial and 
final configuration of the manipulator are often given for the trajectory planning. Usually, 
there are infinite paths for the end effector. Even for a specific path of the end effector, 
there are still infinite trajectories possible for the manipulators. However, some of the 
trajectories are not feasible becasue of arm geometry, obstacles, and some kinematic or dynamic 
constraints. Even with the kinematically feasible trajectories , some computational or logic 
problems in the algorithms may make them impractical. 


3. Trajectory Planning 

In order to move from one point to another, in the task space, one needs to solve for the 
angular information from the spatial information, using the inverse kinematic rela t lonship. 
Consider a robotic manipulator with n degrees of freedom. Let the kinematic relationship 
between Joint angles and the end-effector position and orientation be given by 

X - f (q) (1) 

where X is the m-dlmentional vector of the end-effector position and orientation, and q is the 
n-dimensional vector of Joint angles. For a kinematically redundant manipulator, the dimension 
of q is greater than the dimension of X, (n>m). Differentiating the above relation, we get 

X - J(q)q (2) 

where J(q) • df/dq is the mxn Jacobian matrix, [7]. For a redundant manipulator, the Jacobian 
matrix will have more columns than rows. Moreover, the Inverse of such non-square matrix is not 
defined in a regular sense. However, useful solutions of equation (2) can be found, by using 
the generalized-inverses of the Jacobian matrix J, and is given by 


q - J f X • (I-j r j)Z (3) 

t T T -1 

where J - J (JJ ) is the pseudo-inverse of J, I is the nxn identity matrix, and Z is an 
arbitrary n-dimensional vector. When the vector 7. is selected to be zero, equation (3) reduces 
to 


q - J f X (*) 

which gives the best approximate solution to equation (2). This is in the sense that if q« is 

the solution of (2), given by (U), then ||q«|| < ||q||, where q is any other solution of (2) 
that is given by (3), [5-6]. It should be noted that, such minimum norm, or best approximate 

solution, is defined when there is no restriction in the task space. This cleans that, in a 
restricted environment, the above mentioned best approximate, solution may not be feasible for 
application and may result in collision. 
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Let ua define collision point to be the point on the manipulator body whioh has the 
potential to collide with the obstacle. The collision-free trajectory planning problem here Is 

to develop an algorithm, for the on-line determination of the required Joint angle rates, q, for 
safe manipulator motion. The approach is to continuously monitor the task space, for detecting 
possible collisions. If no potential collision Is detected, the required Joint angle rates are 
generated, using the best approximate solution, to move the end-effector on a shortest distance. 
But when a potential collision point Is detected, the trajectory is modified in order to avoid 
collision. 

M. Obstacle Avoidance 


In order to avoid obstacles, one needs to use the kinematic relationship for the collision 
points, similar to that of equations (1) and (2). Let the potential collision point, In the 
task space, be denoted by X c< Then, similar to equation (2), we can write 

X c - J c (q)q (5) 

where J Is the mxn Jacobian matrix for the collision point. The inverse kinematic solution to 
the above is similar to (3) and Is given by 


q - J*X„ ♦ (I-J*J )Z' 
c c c c 

where J* is the pseudo- inverse of J Q , and Z* 


is an arbitrary n-dlmensional vector. 


( 6 ) 


Now. the problem or obstacle avoidance is that, when a potential collision is detected, the 
highest priority is to avoid the obstacle, and,ir needed modify the position of end-effector. 

In order for the trajectory planning to have minimum norm, we choose Z - 0 as in (4). On 

the other hand we choose Z'*0, like In (6). to account for both collision avoidance and 
trajectory planning. From (3) and (6), a minimum norm solution for Z* Is 


Z ' - [ju-Aj 1. 

C C C C 

Plugging this back into (6), we get 


Then, using the following identity, [6] 
(I-jIj KJtw !)] 1 - [J(I-J*J )] f 

C C C C C 

we get 

q - J*X„ * [J(I-J*J )] f CX-JJ&]. 

' Art C C n n 


c c J 


(7) 


The above relation, generates the Joint angle rates q such that the obstacle is avoided and the 
end effector velocity la modified, for the on-line trajectory planning. 

Now the question is how and in what direction the end effector spatial velocity should be 
changed. For the algorithm to be fast and implementable, a finite search for the minimum norm 

solution is considered. The value of X^ls preselected by the user, and the value of X 

modification is also preselected by a value of e. The value of q may now be found by examining 
seven different directions for rate modification, e.g., e-varlation in plus or minus X,T,Z 

coordinates and also no modification. The smallest norm ||q|| is then chosen for trajectory 
modification, from seven different possibilities. 

The overall algorithm is such that when no potential collision is detected, a minimum norm 

solution q is planned according to (4). But, when a potential collision is detected, the 
obstacle is avoided and the path in modified according to (7). 

In order to develop the algorithm, it is assumed that: 

(1) the solution exists 

(2) the obstacles are represented by polyhedrals. 

(3) the geometrical information about the task area is known, e.g., using sensory systems, the 
positions of obstacles and manipulator links are known. 

(4) the potential collision point on the manipulator links can be detected. 

(5) only a single collision may occur, and will be detected, at any given time. 
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5. The Algorithm 


The following summarizes the steps. Involved In the proposed algorithm, for the on-line 

collision-free trajectory planning of the robotic manipulators. The steps of the algor ltha are: 

(1 ) Determine a minimum-length path for the end-effector, froa the current position to target 
position. 

(2) Check if there Is a potential collision point. If there Is, go to step (5), otherwise 
continue. 

(3) No potential oolllslon Is detected. Hake an lncreaental move according to the Joint angle 
rates vector q, given by equation (4). 

(4) if the end-effector has not reached the target, go to step (2). If it has reached the 
target, go to step (7). 

(5) Potential collision is detected. Hake an Incremental move according to the joint angle 
rates vector q, given by equation (7), such that ||q|| Is minimized in a finite search. 

(6) Go to step (1 ). 

(7) Stop. 

6. Conclusion 


On-line, collision-free trajectory planning Is discussed. An algorithm, which utilizes 
sensed Information about the configuration of the manipulator and obstacles, Is developed based 
on the task priorities. The order of the task priorities are: to avoid collision, to plan the 

shortest path for the end effector, and to choose the minimum norm solution. The algorithm is 
fast and could be implemented on robotic manipulators Tor on-line trajectory planning. 
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1. Abstract 




Space-based robotic systems for diagnosis, repair and assembly of systems will 
require new techniques of planning and manipulation to accomplish these complex 
tasks. This pap e r summar i es results of work in assembly task representation, 
discrete task planning, and control synthesis which provide a design environment for 
flexible assembly systems in manufacturing applications, and wh ich extend to planning 
of manipulation operations in unstructured environmcnts/^ssembly planning Is 
carried out using the AND/OR graph representation which encompasses all possible 
partial orders of operations and may be used to plan assembly sequences. Discrete 
task planning uses the configuration map which facilitates search over a space of 
discrete operations parameters in sequential operations in order to achieve required 
goals in the space of bounded configuration sets. 

^ 2. Introduction 

Space-based robotic systems will be required to perform tasks involving dexterity, 
perception, and planning. Telcrobotic systems integrate human perception and human 
planning capabilities in order to accomplish tasks. Autonomous systems will 
increasingly requir^ imbedded task planning systems with accompanying sensory 
integration, control synthesis, and system architecture to support goal-directed 
activities in an uncertain environment. Diagnosis, repair, and assembly are tasks 
which will be essential t\ the maintenance of space-based systems and require both 
complex manipulation as \ell as reasoning about system configuration and system 
functionality. This paper reviews our recent work on task planning for assembly 
systems and discusses its implications for the development of robotic systems for 
assembly, maintenance, repair\and transport tasks. 

Both manned and unmanned spacecraft require a variety of maintenance and repair 
tasks including materials handling, diagnosis of faults, reasoning about the origin 
of faults, hypothesis formation testing, planning and executing repair 
procedures, disassembly, assembly, ind replacement of parts. Currently these tasks 
may be accomplished in a limited wiv by on-site manual and telcopcratcd systems. As 
the number, complexity, cost, and Vmportancc of these spacecraft increases, 
autonomous systems which can provid^gcrvicc and maintenance on a routine basis will 
become essential. 

Diagnosis and repair arc problems in reasoning as well as manipulation. Any 
successful approach to these issues requires aVcpresentation of the task and an 
automated reasoning system which enables a decomposition of the problem into feasible 
sensing and manipulation procedures. Our work\n assembly planning is based on 
several generations of assembly workcells which wb^ u ^l and demonstrated for 
manufacturing applications [I], These flexible workers incorporated multiple robot 
arms, vision, tactile and force sensing to accomplish tasks in electronic assembly, 
wire harness assembly, and assembly of instrument produces such as copiers and 
printers. 


/' /• 


\ 


\ 




,129 




Our experience with implementing tasks on these prototype workcells is the basis 
for current research on the development of tools for efficient design, programming, 
and implementation of complex systems. Task representation, decomposition, and 
sequencing [2,3,4], discrete task planning, [8] and adaptive control and learning 
techniques [9] are principal issues which are currently being addressed. Embedding 
such adaptation and learning procedures in the control and planning hierarchy is 
fundamental to successful implementation in uncertain environments. In this paper, 
we summarize an approach to assembly task representation and sequencing, and describe 
in more detail the use of the configuration map as a tool in discrete task planning. 

The control functions of the system are allocated hierarchically into Strategic, 
Tactical, Operational, and Device levels. The control synthesis problem is to map 
the control hierarchy onto the set of feasible assembly plans in order to achieve 
desired performance. In this procedure, we seek to iteratively adjust the assignment 
of system resources subject to task precedence and configuration tolerance 
contraints. This procedure requires the definition of motion strategies and motion 
primitives which can be employed. We have developed a detailed understanding of 
sensorless manipulation strategies [3,6,7, 8] which facilitate planning of sliding, 
pushing, and grasping operations. We are studying control structures for vision, 
tactile, and force feedback [9], and have demonstrated feasibility of adaptive 
control strategies for visual servoing. This work on sensor-based control is 
currently being extended to employ learning algorithms at the level of the motion 
primitive in order to improve performance by local adaptation in the face of 
uncertainty in the task environment. We have formulated an approach to quantitative 
description of task uncertainties using entropy methods [10], and have investigated 
the use of this parts entropy approach for planning strategies. We have also 
developed and demonstrated a new approach to arm signature analysis which improves 
the identification of kinematic models of manipulator structures and increases the 
resulting positioning accuracy [11]. 

Implementation of robotic systems in either a telerobotic or autonomous mode will 
require many of these planning, control, and manipulation capabilities. Task 
decomposition and control hierarchy have not been studied sufficiently for the 
telerobotic case. Development of motion primitives and planning of fine-motion 
strategies are important topics for research. The addition of adaptive and learning 
strategies to teleoperator systems is also important. The evolution of autonomous 
systems from telerobotic systems will require more effective models of human task 
planning strategies and task representation. The design of the components and tools 
of the space-based environment will depend on a consistent task representation which 
evolves to accept autonomous manipulation. 

3. Assembly Task Representation 

In our approach to assembly system design, [2,3,4], the planning of assembly of one 
product made up of several parts is viewed as a path search in the state space of all 
possible configurations of that set of parts. A syntax for the representation of 
assemblies has been developed based on contact and attachment relations. A 
decomposable production system implements the backward search for feasible assembly 
sequences based on a hierarchy of preconditions: (1) Release of attachments, (2) 

Stability of subassemblies, (3) Separability of subassemblies, including (a) Local 
analysis of incremental motion, and (b) Global analysis of feasible trajectories. 

Because there are many configurations that can be made from the same set of parts, 
the branching factor from the initial state to the goal state is greater than the 
branching factor from the goal state to the initial state. The backward search is 
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therefore more efficient and corresponds in this case to the problem of disassembling 
the product using reversible operations. The resulting set of feasible assembly 
sequences is represented as an AND/OR graph and used as the basis for enumeration of 
solution trees satisfying system and performance requirements. 

Figure 1 shows an example of an AND/OR graph representation of assembly sequences 
for a simple product with four parts. Each node in the graph corresponds to a 
subassembly and is described in the representation by a relational structure using 
the syntax of contacts and attachments. The hyperarcs correspond to the disassembly 
operations, and the successor nodes to which each hyperarc points correspond to the 
resulting subassemblies produced by the disassembly operation. For most products, 
the assembly operations usually mate two subassemblies, and the resulting hyperarcs 
are typically 2*connectors as in this example. 



Figure 1. AND/OR graph representation of assembly plans for a simple product. 
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A solution tree from a node N in an AND/OR graph is a subgraph that may be defined 
recursively as a subset of branching hyperarcs from the original graph. The AND/OR 
graph representation therefore encompasses all possible partial orderings of assembly 
operations. Moveover, each partial order corresponds to a solution tree from the 
node corresponding to the final (assembled) product. The AND/OR graph representation 
therefore permits one to explore the space of all possible plans for assembly or 
disassembly of the product. The problem of selecting the best assembly plan may 
therefore be viewed as a search problem in the AND/OR graph space, and for some given 
evaluation function on the graph, generic search algorithms such as AO* [12] may be 
used. In practice, the development of such an evaluation function is very difficult 
since it would often depend explicitly on implementation issues such as choice of 
devices and underlying control strategies. We have explored the assignment of 
weights to hyperarcs using criteria of (a) operation complexity, and (b) subassembly 
degrees of freeddm, or parts entropy [10]. Such an approach is viewed as a 
preliminary search procedure which may narrow the search space for later detailed 
examination using implementation details. In the simple examples studied, the 
resulting ranking of candidate assembly sequences was consistent with intuitive 
assessment of complexity. 

The representation of assembly plans is particularly important for systems which do 
online planning or scheduling. Previous studies of online planning problems [13] 
have used discrete sequence representation or precedence diagrams of operations. In 
the precedence diagram formalism, typically no single partial order can encompass 
every possible assembly sequence. The AND/OR graph represents all possible partial 
orderings of operations, and each partial order corresponds to a solution tree from 

the node corresponding to the final product. We have illustrated the use of the 
AND/OR graph for online scheduling of a simple robotic workstation with random 
presentation of parts [2], The resulting analysis showed a relative improvement in 
efficiency (number of operations required) from fixed sequence operation of 6% for 
precedence diagrams and 18% for the AND/OR graph. The principal advantage in this 
example was the reduced need for buffering and corresponding retrieval of parts. 

The AND/OR graph representation provides a framework for the planning and 
scheduling of operations sequences. The problems of testing, disassembly, repair, 
and assembly all benefit from a unified representation which encompasses partial 
ordering of procedures. Preliminary search of the task space may reduce the 
candidate subtrees substantially, but the development of final plans typically 
involves directly the implementation and specification of the underlying devices and 
motions. In the next section we describe a tool for discrete task planning which 
facilitates exploration of alternative sequences of operations at the level of parts 
configurations. 

4. Discrete Task Planning 

A sequence of assembly or disassembly subtasks is implemented by performing • 

operations on the parts using system resources such as robot hands, fixtures or 
sensors. The allocation of these resources and the synthesis of control programs to 
coordinate them must be developed in a second level of planning. In general, such 
operations require detailed motion planning of individual devices and is extremely 
difficult. In this section, we describe a definition of discrete operations which 
lend themselves to planning through manipulation of the configuration map relating 
input and output configuration states. 

Any subtree of the AND/OR graph may be thought of as a subtask precedence graph. 
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tad each branch of the subtask precedence graph defines a process in the 
configuration space of the parts. An assembly operation can then be defined by: 


Assembly Operation: 

Given c° - (C, # , C“) € C, 
control manipulation, tenting, and computation 
to achieve e f — (C| f , C*) € T, then 
execute operation, 

where T •“ tolerance set, 

T C C — C, x 0^ for entities i, j, 

is the set of configurations (region of configuration space (14]) for which an 
operation on i,j can be successfully performed. 

This definition emphasizes the basic problem in assembly as the control over 
configuration uncertainty in order to meet tolerance requirements of successive 
operations. While it is possible to define probability distributions over 
configurations of parts, in practice, it is very difficult to accurately estimate 
such distributions, and it is cumbersome to propagate the effect of such 
distributions through successive operations in a sequence. The configuration map 
used here provides a tool to compute the effect of operations on bounding sets of 
configuration points. 

A bounding set B(v) is defined as 
B(v) - (possible outcomes of v) 

where v is a bounded variable. We can define in turn: 

Joint bounding set: B(v t , v„ .... v # ) 

Conditional bounding set: Bfv^Vga-ii) »• {v^v^it) 6 B(v 1 ,v 2 )} 

Sum of bounding sets: A +• B —• {v|v — a+b for a 6 A, b 6 B} 

Scalar multiplication: cA = {v|v = ca for a € A}. 

An operation which alters the configuration of a part may be described by a mapping 
between the initial configuration, 6i, and the final configuration 0». An operation 
with a unique mapping occupies a single point in (C*space x C-space) and completely 
defines the change in configuration state of the system. In this case, planning of 
operations reduces to planning of unique trajectories in configuration space. As 
discussed above, such unique mappings are often of limited use due to the 
uncertain! y in configurations and the finite tolerance of operations. Then, states 
of the objects may be described by bounding sets of points in the configuration 
space. 
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The configuration map M(Ai., Bj) describes a single operation which maps a bounded 
set of input points to a bounded set of output points: 

M^.S,): {SJ - {S 2 }. 

The configuration map takes on logical values in (C-spacc x C-space) where each 
logical T defines a feasible mapping. The configuration map for a rigid part is a 
function of twelve dimensions, although in many cases these degrees of freedom are 
not of equal interest. 

The usefulness of the configuration map representation of operations lies in the 
ease of combining sequential operations. An operation M,(9i,<k) followed by an 
operation 0j) is defined as: 

9 ( ) - MjMj - U a {Mj(a, o)}. 

Sequences of alternative operations may therefore be compared using simple 
relations. 

The configuration map is particularly useful in cases where inputs and outputs may 
be partitioned into bounded sets. If we identify N subintervals B of the output 
space and N subintervals of A of the input space, then a symbolic mapping: 

M* - Uj{Aj x B. | M(*. a) > 0 }. 

defines bounded regions of the configuration map associated with transformations of 
bounded sets due to a given operation. A useful instance of the bounded set map 
occurs when we let: 

°)>U 

Then the configuration map 
M’ — U. Aj x Bj 

is rectangular and the operation is completely defined by the symbolic map and the 
definition of the underlying sets. 

The product of rectangular configuration maps is completely defined by bounding set 
operations: 

MjM 2 — Uj *Bj x {U k € 2l c . l \) 

where 


«Cj - {k| J A. n *B k *e}. 
is the resulting configuration map product. 

Figure 2 shows an example of a peg insertion operation in two dimensions. This type 
of problem has been studied from the point of view of trajectory planning in 
configuration space [15]. The configuration map shown in figure 2 is derived from 
such a trajectory analysis and summarizes the input-output relations in a manner 
which permits the resulting discrete operation to be integrated into task plan. A 
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different configuration nap is developed for each set of discrete operations 
parameters, and the ability to form configuration map products permits search over 
the space of operations sequences. In figure 2, the x position of the peg is 
regarded as the independent variable of the map, and the initial imposition of the 
peg is fixed for a given configuration map. The operation moves the peg in a •* 
direction using a compliant move and directional uncertainty represented by the 
velocity cone [16]. 

The resulting configuration map in figure 2 has three output bands corresponding to 
successful insertion, miss-to*the-left, and miss-to-the- right. These three bands 
occur consistently for different parameter values. Five input bands may then be 
reconstructed and labelled defining a partitioning of the input configuration space. 

The resulting map may be 'rectangularized* as shown by the dotted areas, and in that 
form the symbolic mapping provide a complete description of the operation and a basis 
for search procedures. 


CONFIGURATION SPACE 




Figure 2. Configuration map for peg in hole example. 
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An example of a product of configuration maps is shown for a different set of 
operations in figure 3. Each of these maps is derived from our analysis of sliding 
objects [3,6,7] and corresponds to the orientations of a polygonal object being 
pushed by a two-dimensional fence of finite length. Equivalently, the object may be 
moving on a conveyor belt past a fixed fence. The independent variable in each map 
is the object orientation while the operation parameter is the fence angle. The 
uncertainty represented by the finite width bands in the maps is a result of the 
unknown support distributions of the objects. In [3,6,7] we derived bounds on the 
rates of rotation of such objects and have used these to compute the configuration 
maps for this example. The product of configuration maps therefore defines the 
bounds on the sets of orientations resulting from successive fence pushing 
operations, and can be used as a planning tool for designing sequences of fence push 
operations to achieve required goals. 

For discrete tasks, the space of all operations sequences may be represented by a 
tree. Arcs correspond to operations, and each node represents a set of possible 
configuration states after execution of all the operations on the path from the root 
to that node. Figure 4 illustrates one such tree which corresponds to sequences of 
fence pushing operations for fences of different angles operating on the object shown 
in figure 3. The possible configurations of a part at a given node are obtained by 
multiplying the configuration maps for the operations on the path from the root to 
the node. Traversing the tree in order to search it is facilitated by the ease with 
which products of multiple configuration maps can be computed using the code sets. 




Figure 3. Product of two configuration maps. 
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Figure 4. Tree search for operations sequence. 



Figure 5. Resulting sequence 
of fence push operations. 


Each node is labelled with the subset of the indices j of B for the bands B for the 
fence angle of the preceding arc. The goal of this task was to reduce the set of 
possible configurations to a narrow range of orientation, and a search strategy was 
implemented to reduce the number of output bands to one using the minimum number of 
operations. 

Searching this tree of discrete operations exhaustively is computationally 
difficult due to the high branching factor which results from the available set of 
fence angles at each step. Two techniques have been developed to make this search 
feasible. First, there are systematic relations among bands for different operations 
parameters. Since there are only a few distinct code sets for the output arcs, it is 
often possible to systematically choose the subset of arcs which need to be followed 
among these outputs. Second, branches of the tree which develop code sets which have 
occurred previously in a shorter route may be pruned during search. 

Implementation of these search techniques permits solution of the fence sequence 
design problem with the resulting design shown in figure S. This parts feeder design 
will align parts with the geometry shown in figure 3 independent of the input 
orientation. Bounds on the orientation of the resulting single band are also derived 
from the procedure. The output part is then aligned for acquisition or handling by a 
robot. Computation for this search problem requires a few seconds of computation. 

5. Conclusions 

In this paper, we have reviewed several results in assembly representation, 
discrete task planning, and their relation to underlying control strategies. These 
methods of planning and manipulation are important for applications which will 
require autonomous systems to carry out complex tasks in diagnosis, repair, and 
assembly in space. The development of such analytical tools and their demonstration 
in prototype systems will be an important part of the evolution of tclerobotic and 
autonomous systems for space applications. 
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»es H»e interpreter of a task level robot programming system called 
Handey/fiandey is a system that can recognize, manipulate and assemble polyhedral 
parts from given only a specification of the goal. To perform an assembly, Handey makes 
use of a recognition module, a gross motion planner, a grasp planner, a local approach 
planner and is capable of planning part re-orientation. The possibility of including these 
modulesjn-a telerobotics work-station is discussed. 
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2 Introduction 


The projected increase in the use of robots in space will make their increased autonomy 
essential. Direct teleoperation of robots in complicated, repetitive tasks, such as those 
found in space, can be very tedious. Robot autonomy would relieve the operators from 
unnecessary fatigue as well as improving reliability and cost [1]. 

One step towards improving the autonomy of robots consists of having a system 
capable of planning simple grasp and assembly operations. This goal seems to be a 
fairly simple and short term objective and yet, it has only been achieved for very well 
structured environments. The early research on automatic planning of robot operations 
[2,3,4] focused exclusively on simple situations involving completely modeled environ- 
ments. 

More recent work has now made it possible to design systems wQrking in much more 
general environments, including environments with significant uncertainty. These envi- 
ronments resemble the type of environment one can expect to find in the vicinity of a 
space station. This type of environment can include complex parts and six degree-of- 
freedom revolute arms, all of it modeled with a CAD system. In this paper we describe 
Handey, a new system that embodies many of the fruits of this more recent research. 
While Handey still makes strong assumptions about its environments and tasks, we be- 
lieve that these assumptions are realistic enough so that Handey can contribute towards 
improving the tele-operation of manipulators. 


3 Handey overview 

Handey is a task-level [5] robot programming interpreter, that is, the commands given 
to the system are not robot motions or gripper operations as in VAL or LM [6,7] but 
simply by describing a certain desired state of an assembly. For example, a full sequence 
of robot motions, gripper operations and sensor calls can be replaced in a task-level 
robot programming system by a single statement: PLACE PART A on PART B. The 
interpreter is responsible for planning and carrying out the detailed motions and other 
actions which lead to this assembly. In its current stage of development, however, 
Handey makes use of the ’’perfect world” hypothesis and so, does not take in account 
problems related to uncertainty or unexpected events. For example Handey does not 
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Figure 1: Experimental setup 


include a compliant motion planner which would plan assembly strategies in the presence 
of uncertainty [8,9, 10], and does not provide program verification techniques based on 
uncertainty propagation to patch a predefined plan [11,12]. This remains as future work. 


Figure 1 represents a scene used during the development of the system, the doted line 
on the table shows the limits of the field of view of the laser range finder (this area is 
called the V-area in the rest of the paper). 

Part A is assumed to be located in the V-area. Nothing is assumed concerning this 
area: part A can be in any location and it can be partially obscured from the range 
finder by other objects. Except for part A it is not necessary for parts entirely located 
in the V-area to be modeled in the CAD system. The location of part B is assumed to 
be known, as are the locations of obstacles in the workspace outside of the V-area, such 
as the laser-camera device, which we plan to use in the future to find the exact relative 
position between the gripper and the part. 

The user describes the final assembly with a set of relationships between geometric 
features of parts A and B, then he activates the interpreter. The following is a typical 
sequence performed by the interpreter to plan the detailed motions and operations 
necessary to achieve the assembly. 

Determining the Final Location. Based on the specified symbolic geometric rela- 
tionships between parts A and B, a geometric transform representing the relative 
location between A and B after the assembly is computed. 

Recognizing and Localizing Part A. A model-based vision algorithm is executed 
to determine the location of part A. 

Planning a Grasping operation. The grasp planner first tries to find a grasp which 
permits placing part A directly on part B. If this is not possible, a regrasping 
operation will be planned later. 

Planning the first gross motion. A collision- free path is planned from the initial 
position to a point on the boundary of the V-area. 

Planning a collision free approach. Since the scene in the V-area is not modeled 
in the CAD system, it is not possible to find a collision free path by the method 
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used for gross-motion planning. Another planner, using the data provided by the 
range finder, plans a path for the gripper among the obstacles which are located 
in the V-area. 

Planning re-orientation. In many occasions it is not possible to grasp and to assem- 
ble the part keeping the same relative position between the part and the gripper. 
In this case a regrasp operation is planed in a obstacle free portion of the work 
space. 

Planning the remaining gross motions. The regrasp planner produces a number of 
intermediary locations to be reached by the robot, during the re-orientation phase, 
this phase computes all the paths necessary the perform the regrasping and the 
path to the final destination. 

4 Functional description of Handey 

Handey is composed of several modules, most of these modules correspond to substantial 
pieces of code. 

4.1 Experimental Environment 

The hardware-dependent software primitives provide a way for the planner to ignore 
the details needed to operate real-world equipment. It is crucial that these modules be 
quite good, since they determine the overal system’s precision in localizing and moving 
parts and, as a consequence, will determine the success of the experimentats. Handey 
makes use of a very limited number of such hardware dependent primitives. 

• Range finder calibration: this primitive eliminates non-linearity due to the technol- 
ogy of the laser sensor and scanner hardware. It also determines the scale factors 
between the sensor and the model. 

• Robot Vision calibration: this primitive determines accurately the relative location 
between the reference frames associated with the robot and the range finder. 

• Depth map acquisition: this primitive activates the range finder and returns a 
depth map in standard units. 

• Joint motion: in its current version Handey makes use of one robot motion primi- 
tive: ’’MOVE-JOINT TO (ql,q2, q6)” to command a coordinated motion of the 

robot. The gripper is operated in a binary mode. * 

4.2 The world modeling system 

The world modeling system is used to construct polyhedral models of the parts involved 
in the assembly including the obstacles (table, ceiling, etc.) and the robot. It is also 
used to maintain a model of the world during the planning. Once geometric models have 
been created it is possible to use the following primitives to create, modify or interpret 
a scene: 

• assign a location to a part, 

• express the location of a part in a different reference frame, 

• affix and unfix parts from the gripper of the robot 
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A of Part A is Parallel to Face A of Part B 
C of Part A i» Against Face C of Part B 
B of part A is Against Face B of Part B 


the final assembly 



Figure 3: Depth-map produced by the laser range finder 

4.3 Computing the final location of part A 

The user can describe symbolically the next assembly step by specifying a set of geomet- 
ric relationships which should hold between geometric features of part A and geometric 
features of the sub-assembly. Figure 2 represents the set of relationships used to describe 
the final location of part A. 

The set of geometric relationships is then translated into a set of algebraic equations 
[13]. The system then sol vest he the set of equations to compute the relative position 
between the part and the sub-assembly. At the present time this feature is not integrated 
into the on-line version of Handey but works separately. 

4.4 Locating part A 

The range finder is activated and produces a depth map (figure 3). The map is then 
processed as if it was an image except that the brightness corresponds directly to the 
elevation above the work table. 

A standard edge operator [15] is run over the image and extended linear segments 
are identified in the resulting array. Note that this process identifies 3D edge segments. 
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Figure 4 Matching model edges with scene edges 


not just their projection in an image. The method used for object localization is a 
simple hypothesize- verify algorithm based on matching linear segments in the depth 
map to edges in the polyhedral model of the part. This method is a variation of the 
method described in Lozano-Perez and Grimson [16] using edge data instead of face 
data. Figure 4 represents one matching between edges of the scene and edges of the 
model. 

4.5 Planning collision-free motions 

At a number of points in the operation of the system, a collision-free path is required 
from one specified location to another. Handey uses a simplified version of the path 
planner described in Lozano-Perez [17]. This path planner uses the robot's joint space 
as the configuration space. The version of the path planner used by Handey never 
computes configuration spaces of dimension greater than three, but it allows motions 
requiring six degrees of freedom. Essentially, we assume that a path from the start to 
the goal exists such that the last three joints of the arm retain their starting values until 
some intermediate point where they change their values at the goal and never change 
after that. It is easy to construct cases where this assumption will fail, but it works in 
a large percentage of actual cases. 

The actual planning proceeds as follows: An approximate arm model is built in which 
the last three joints are replaced by a box. This box must be large enough to enclose the 
last three links, the hand and any object in the hand, not only at their start and goal 
position but also at the intermediate positions between the two. The three-dimensional 
configuration space for this model can then be built. We find the closest free point in 
this configuration space to both start and goal positions, a path is then found between 
these two free points. Note that the complete robot is guaranteed to be safe along this 
path, for the whole range of values of the last three joints between the start and the 
goal. Therefore we can simply interpolate the values of the last three joints between the 
start and the goal values. Then, we plan a path using the original model of the robot 
between the free point and the start point itself. We also plan a path from the free point 
closest to the goal itself. In these two paths the value of the last three joints are fixed. 
The concatenation of these three paths form the desired path. Figure 5 represents the 
path found the final motion. 
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Figure 5 Example of * path generated by the path planner 



Figure 6 Back-projection of parts 


4.6 Grasping 

Once the part has been located Handey chooses a grasp. This operation has to take in 
account several constraints: 

• the grasp should be stable, 

• a path should exist inside the V-area to reach the grasp, 

• the grasp should permit assembling the part once it is in the gripper. 

The last constraint can be satisfied by “back projecting’’ all the obstacles in the 
V-area. After this operation virtual obstacles exist in the V-area, these obstacles have 
the same relative location with part A that the real obstacles will have in 'he final 
sub-assembly. If one can find a grasp in this environment then it is guaranteed than the 
grasp will permit assembling the part (figure 6). 

4.6.1 Finding a stable grasp 

In its current version, Handey uses a grasp planner designed for a gripper equipped with 
parallel jaws. A future version of Handey will include a more sophisticated planner 
designed for the three fingers JPL-Stanford-MIT hand [14]. Currently, the planner 






Figure 7 Grasp-poinU usociated with one face 



Figure 5 Angular range associated with a grasp 


associates two grasps for each locally convex edge of the model. A grasp is defined by 
one of the face adjacent to the edge and a grasping point. The grasping point is located 
on the face at a prespecified distance from the edge. Figure 7 represents all the grasping 
points of one face of part A. 

To be valid a grasp should satisfied three conditions: 

1. a parallel face should exist and should permit a grasp (mutual visibility ; 18 j) with 
an allowed distance between the two faces. 

2. The gripper should be capable of some rotation around the grasping point (grasping 
range), 

3. an inverse kinematic solution should exist at the grasping point. 

The grasping range can be computed using a submodule of the path planner. The 
grasps are sorted with such grasp permitting the most vertical approach. Figure 8 
represents the gripper into two end-points of an angular range. 

4.6.2 Planning motions in the V-area 

Since no information on obstacles exists in the world-modeling system for the V-area. 
we must rake in account the presence of objects reflected in the depth map. For this 
purpose we use a planner specialized for planning the motion of the hand in the grasp 
plane. The grasp plane is a plane parallel and at equal distance from the two faces 
defining the grasp. When approaching a grasp the fingers remain parallel to the grasp 
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Figure 9: PUnning a pith in the grasp plane 


plane and centered about it but, otherwise, are free to rotate and translate in the plane. 
Obstacles are projected into this plane to reflect the possibility that they collide either 
with one of the two fingers, the cross-piece of the hand, or the force sensor. 

The planner uses a method loosely modeled on the potential field method for obstacles 
avoidance [19]. The goal of the grasp planner is to bring a gripping point located between 
the fingers as close as possible from the gaping point without colliding. The grasping 
point attracts the gripping point of the grippir while projected obstacles on the grasping 
plan repel the boundaries of the projected gripper parts (fingers, cross-piece and force 
sensor). These pseudo-forces are combined in such a way that the gripper is guided 
toward the goal in X,Y,0 on the grasp plane. The initial position and orientation of 
the gripper is given by the grasping planner. Figure 9 represents the evolution from the 
initial position toward the goal. 


4.7 Regrasping 

Back-projected objects are artificially added to the depth map so that the final grasp 
will also permit the assembly. However this may constrain the problem so much that 
a feasible solution cannot be found. Handey will backtrack among sorted grasps a 
limited number of times before giving up and trying to find a solution with regrasping. 
The V-area path planner is then called without back-projected parts and a regrasping 
operation is planned. 

For each part the grasp planner uses two data structures: placements and grasps. 

!. A placement is a way of placing the part at a particular location on the work table 
This location is chosen in an area known to be free of any obstacle, the regrasping 
will take place in this area. A parameter ^ is associated with each placement P. 
Changing this parameter corresponds to rotating the part on the table around a 
vertical axis. All the placements P t are computed automatically by computing the 
stable faces of the convex hull of the part. 

2. A grasp is defined by a parameter 6 associated to each grasp G. Changing this 
parameter corresponds to rotating the gripper along an axis perpendicular to the 
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grasping face and containing the grasping point. The set of grasps G 2 is also 
computed automatically. 

In order to plan a regrasping operation it is necessary to compute all the vertices of 
the “regrasping graph”. A vertex consists of a data structure defined by a pair P X G } 
having a non-empty By map. A map is built by sampling y and B over the interval 
(0.0, 2m). Each y,B specifies a single position of the gripper. To be valid, a pair should 
correspond to a position of the gripper where a solution of the inverse kinematic exists. 
The map is the set of all the valid By pairs. 

There are two operations necessary to perform a re-orientation. 

1. Moving from one placement P t to another P*. This is possible when a grasp G ; 
exists, such that the maps associated with P X G } and PkG } have at least one valid 
By pair. 

2. Changing from one grasp G } to another grasp G\ . This is possible when a place- 
ment P, exists such that the map associated with P,G : and P x G k has at least one 
valid By pair. 

The regrasp planner is given an initial position of the part inside the gripper and 
a final Gf.yj grasp which permits the assembly operation. The goal of the regrasp- 
planner is to find a way through various placements and grasps between the initial and 
the final grasps. This is represented in figure 10. Horizontal arcs in the figure represent 
motions of the part from one placement to another and vertical arcs represent motions 
of the gripper to change the grasp. 

5 Applications to Telerobotics 

Using telemanipulators in earth orbit has long been recognized as a difficult task. The 
trend has been to increase the level of commands available to the operator [20] . Proto- 
type telerobotics workstations have been built integrating such high level teleoperation 
commands. For example, hybrid-control permits the computer to maintain a drill on a 
given axis while the operator can concentrate controlling the force necessary to perform 
the drilling operation. 

Based on our experience we believe that it is not unrealistic to add some of the capa- 
bilities of Handey to such a work-station. As explained in 4.1 the number of primitives 
used by Handey is fairly limited and should be available in such a workstation anyway. 
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The most limitating factor being the possibility of including a range finder on the mobile 
remote servicer (RMS). Once integrated, one can imagine to use a system such Handey 
in three modes. 

• autonomous inode: The operator describes the next assembly step. The system 
computes the sequence of operations and sensor calls to perform the assembly a 
graphic simulation is presented to the operator before actual execution. 

• partially automatic mode In this mode the operator asks the system to plan 
certain portions of the assembly, for example, the system can plan the trajectory 
to align two axes so that a drilling operation can take place under the control of 
the operator. 

• monitoring In this mode the operator first describes what result he expects to 
achieve. 'The system monitors the task and sends an alarm when it detects that 
the present configuration of the system makes it difficult or impossible to reach 
the goal. For example grasping a part in a way that the final assembly or an 
intermediate path is difficult or impossible. 

6 Conclusion 

Watching Handey performing an assembly is always astonishing and fun, no operator 
would ever program a task the way our system does. Potentially, we believe that future 
versions of Handey could be more efficient in performing assembly tasks than typical 
operators. It could plan paths more effectively in term of time, energy, and safety. It 
would be less likely to make a mistake such as grasping a part and not being able to move 
it at a later stage of the assembly because of mechanical stops or collisions. Handey is 
based on well-establish geometric principles which can make it a robust system. For this 
reason, it is possible to think of the Handey interpreter as a target system for higher- 
level planners. The current Handey implementation on a Lisp Machine is still quite 
slow; it takes approximately 10 min to plan a single pick and place operation. But, we 
believe that is possible to reduce this time significantly simply by reimplementing it in 
a machine with fast floating point hardware. 

Telerobotics is often presented as feasible alternative to an infeasible autonomous 
robot. This is certainly true at the present time, but the contrary may be true in the 
future, that is, the technology necessary to achieve good tele-presence may be more 
sophisticated than the technology necessary to provide on-board intelligence and dex- 
terity. 
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Manipulator Control: An Overview 

H. Seraji 
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Pasadena, CA 91 109 

The next generation of robot manipulators will exhibit rudimentary intelligence by ac- 
quisition and perception of sensory data and operation in partially unknown environments 
with some degree of autonomy. One of the crucial components for realizing these capabili- 
ties is a sophisticated manipulator control system. Consequently, research in advancing the 
control of manipulators is an essential step towards the development of future robots. 

The topic of “Manipulator Control” was chosen as one of the central themes of the 
NASA Workshop on Space Telerobotics. Six sessions containing 38 papers were devoted 
to manipulator control. In the course of presentations and subsequent panel discussions, it 
became evident that the topics under manipulator control may broadly be classified into two 
categories. 

The first category contains topics which are largely resolved by now, such as advanced 
control methodologies for single-arm robots in free motion. Theoretical research in these 
areas has been pursued actively during the past decade and has reached a modest level of 
maturity. Nevertheless, there are very few practical implementations of the advanced control 
techniques, even at the academic level. It was generally felt that more emphasis should be 
placed on implementing advanced control methodologies on robot manipulators. 

The second category covers those topics which are partially known at the present time. 
These include force control, coordinated multiple-arm control and control of redundant and 
flexible arms. Although some partial results have been reported on these topics, considerable 
research is still much needed. For instance, in the area of force control, when the robot makes 
contact with the environment, dynamic models which adequately represent this interaction 
are not yet completely developed. In a similar manner, optimum coordination and task 
allocation among multiple cooperative arms are not yet resolved. It was generally believed 
that support of fundamental theoretical research on these topics is currently needed. 

In summary, the manipulator control sessions at the Workshop were successful in pro- 
viding a forum for technical interaction among researchers in robot control. Furthermore, 
the findings of the Workshop were beneficial to the robotics community and in particular to 
NASA in enhancing its perception of manipulator control technology and in identifying the 
directions of future research and development in robot control. 





/>: •••- 


Adaptive Force-Position Control for Teleoperated Manipulators 



A.J. Koivo 

.7 . : " / S 


Purdue University 
Lafayette, IN 47907 

P Pi 5' ^ J 

c f • 

Abstract 



An adaptive controller with self-tuning can be designed for teleoperated robotic manipulators by ^ 
determining a time-series model for the function of the teleoperator. Specifically, the position and force I 
exerted by the operator are modelled for determining the derived values for the trajectory of the end- I 

effector of the manipulator. Thus, the adaptive controller can be designed by following the steps which I 

have previously been presented for the controller design of the gross motion. ) 

1* Introduction 

A teleoperated robotic manipulator refers usually to a system in which an operator equipped with sufficient sen- 
sors, effectors and computer intelligence can make the manipulator perform complex tasks either under human supervi- 
sion or autonomously. The human operator supervises the robotic system which is performing low-level tasks by inter- 
mittently monitoring and/or reprogramming the computer. Thus, the telcoperator can increase the level of intelligence 
of the overall system. 

The teleoperator functions in the system as the "master" and the manipulator as the "slave". The teleopcrator is 
mainly interested in the motion of the end-effector, and not so much in the motion of the intervening segments. 
Although the control of manipulator motion is commonly performed in the joint space, it can also be accomplished 
directly in the Cartesian coordinate system [5], If the motion of the master (the teleoperator) is described in the Carte- 
sian world coordinate system, the slave can be made to follow the motion of the master by controlling the motion of the 
slave (manipulator) in the Cartesian base coordinate system. In the preliminary work described here, we will use this 
approach to control the force exerted by the end-effector of the manipulator, and its gross motion. We will construct an 
adaptive self-tuning controller for the control of the force and position of the end-effector. 

The overall system is Grst described briefly, and the problem formulation is given. A time-series model for the 
motion of the teleoperator is then developed. This model is used to predict the desired motion of the manipulator. An 
adaptive controller is then designed to make the manipulator follow the desired motion without the supervision of the 
operator. 

2. Teleoperated Robotic Manipulator System and Problem Statement 

The overall system consists of a robotic manipulator with an end-effector, computer, and a teleoperator, whose 
arm and hand are constrained to have the same configuration as the manipulator. The hand is assumed to possess two 
(jaw-like) fingers. It will be assumed in this preliminary study that the position of the hand as well as the force and 
moments exerted by the hand can be measured. 

The measurements of the position of the hand and the forces (moments) exerted by the hand on the object will be 
described as a multivariate discrete time-series model. The parameters of this model are estimated recursively by the 
least squares error method on-line. The resulting model will be used to predict the desired values of the variables for 
controlling the manipulator, and its end-effector. 

The dynamics of the manipulator will also be modelled by means of a multivariate stochastic discrete time-series 
equation with unknown parameters. This vector difference equation is used as the basis in designing an adaptive self- 
tuning controller for the manipulator motion. 

The problems to be considered in the following consist of constructing (i) a discrete time-series model for the 
desired values of the position and forces for the end-effector; (ii) a multivariate auto-regressive (ARX) model with exter- 
nal inputs for designing an adaptive self-tuning controller for the dynamics of the manipulator. The difference equation 
model for the desired values of the position and force are based on the measurements which become available when the 
teleoperator performs a task (i.e., teach-by-doing). The ARX-model is determined on the basis of the measurements 
available from the position sensors and the force sensors of the manipulator. We will assume that the forces exerted by 
the end-effector on the object axe “soft", i.e., that they can be modelled by linear springs. 

2. Mathematical Model for Teleoperated Manipulator System 

In order to make the end-effector follow the path determined by the teleopcrator, and exert the force (torque) 
specified by the same operator, the values of these variables will be measured as a function of time. The future values 
of these variables can be predicted by using a time-series model constructed on the basis of the measurements. Suppose 
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that the teleoperator exerts a force r*(k) at time kT (T= sampling period) on an object, while following a trajectory 
passing through the points p d (k) expressed relative to a chosen Cartesian world coordinate system. These values can be 
modelled by time-series models: 

f'W-Aj + 2JAf ^(k-IJ + ^k) (1) 

1-1 

P d (k) - Bt + £ Bf p d (k-i) + f| d (k) (2) 

i-i 

where the equation error is signified by £ d (k), and q d (k); the unknown parameters in the matrices A d , and B d , j =* 
0,1,... ,n are estimated by the least squares error method on the basis of the measurements. 

The one-step ahead predicted values for the force f 1 and p d are computed by the following equations: 

r*(k+l|k) - ki + S A? f(k-i+l) (3) 

1-1 

p d (k+l|k)-Bi + EB?p J (k-i + l) «) 

i-l 

where the terms on the right are known from the measurements and the calculations of the parameter estimates. 

To construct a controller for the manipulator, an ARX-model is used as the basis of the design. If the position of 
the end-effector relative to the Cartesian base coordinate system p(k) at time kT is measured (e.g., encoder readings), 
while it exerts a force f(k) on an object, then an ARX-model for the measurements can be written as: 

tn 

P(k) - C 0 + £ Cip(k-i) + C^k-lMk) (5) 

i-i 

'00 - D 0 + £ Djf(k-i) + E,ii(k-l)+« a (k) (8) 

i-i 

where the modelling errors are denoted by e t (k) and e 3 (k); the unknown parameters in the matrices Cj, and j, j = 0,...,m 
are estimated recursively on line by the least squares error method on the basis of the available measurements. 

The desired values for the position and force are related to the values p d (k) and f*(k) determined by equations (l) 
through (4) for the teleoperator. By applying appropriate coordinate transformation, which relates the Cartesian world 
coordinate sysjtem used in describing (p d (k)} and {r(k)), to the Cartesian base coordinate system, the desired values 
{p (k)} and {f (k)} expressed in the base coordinate system can be determined. 

Having obtained the desired values for the manipulator motion, an adaptive self-tuning controller can next be 
designed. It is determined by minimising the following performance criterion: 

Mu] =■ E{||p(k + i)-p d (k+i|k)||^ ) +||r(k+i)-7 1 (k+i|k)||l + || u (k)||?,} (?) 

where the matrix S represents the selection matrix for determining when the force-servoing or position-servoing will be 
used. The norm refers to the usual generalised Euclidean norm. The expectation operation is conditioned on the avail- 
able measurements. 

The problem is solved by minimising I k [u] in equation (7) subject to the plant equation constraints given by equa- 
tions (5) and (6). 

The minimising controller u (k) is specified by the following equation: 

G . (I-S)(C 0 + £c,p(k+l-i)+G , u’(k)-p d (k+l |k)]+ 

j-t 

+ Eis[D 0 +2D i f(k-i)+E 1 u'(k)-f d (k+l|k)]+Ru*(k) - 0 (8) 

i-l 

Equation (8) can be solved for the control input u*(k) in the feedback form. It should be observed that the desired 
trajectory must also be updated according to equations (1) through (4), with the model parameters. 

Simulations studies are currently being conducted to demonstrate for the feasibility of the approach. 

4. Conclusions 

An adaptive self-tuning controller design has been presented for the operation of teleoperated manipulators. The 
approach is first to model the function of the teleoporator. Then, the controller design is performed on the basis of the 
desired trajectory determined. Simulation studies are currently being conducted using the approach. 
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toofifitivi 4ul-ira robot*. Is the poai t ioa-poti t loa eoatrol 
adapt It* controllers ensure that the end-effector positions of both irai 
daalrad trajectories ia Cartesian apaoa despite unknown tta«-fi 
interaction forces atartad through the load. Ia the position-hybrid ot 
strstsgy, tba adapt It* eoatrollar of oao iri ooatrola end-effector motic 
tba fra# directions aad applied force* ia th# eoaatralat diractioaa; vhil 
adaptiT# eoatrollar of tba other ara aaaarai that tba aad-affactor t 
daalrad poaitioa tra jaator laa. Ia tba hybr Id-hybr id eoatrol stratag] 
adaptiva eoatrollara aaaara that both amd-affactora track rafaraaea pot 
trajaetoriaa while aiaml taaeously applying daalrad foreai oa tba load* ] 
tbraa control atrataglee. tba eros a-eoapl lag affacta bctvaaa tba ara 
traatad tc ^disturbances* which ara rajactad by tba adaptive ooatrollara 
following daalrad eoaiaadi in a eoaaon fraaa of rafaranoa. Tba ada 
eontrollart do sot require the ^bapTez^aatheaat leal aodal ot tba ara dye 
or any knowledge of tba ara dyaaalo paraaatara or tba load paraaatara ai 
aan aad stiffness. Tba eoatrollara hew# s lapis atrnotiraa mi 
coapatat tonally fait for on-line iaplaaantatioa with high stapling rates 

1 • Iatrodoct ion 

Daring the pest daeade. robot aaaipnlators bnva baan ntilisad in industry for parforaing staple tasks, aad 
it la foraaaan that in tba naar future anthropomorphic robot a will raplaoa huaan operators in oarryiag ont 
warioui eoaplaz tasks both In industry aad ia hazardous snTlroaaeats. Nevertheless, present-day robots earn be 
considered at best as "band icapped” operators due to tbair aingle-ara structure. It la evident that 
multiplicity of robot eras yields greater dexterity and ineraaaad efficiency and provides tba oapability of 
handling larger loads. Dual-ar* robots will therefore have capabilities which aay natch those of aabideztrou* 
baaan operators la deztsrlty aad efficiency. 

Tba research on dual-ara robots la at its early stages at the present tlaa, and a faw approaches ara 
csrrantly available. Nakano at el. [I] propose a method for control of dual-ara robots in a aastar/alava 
manner. lahida [2] considers parallel aad rotational transfer of loads naing dual-ara robots. Fuji! tad Cnroao 
[3] suggest a technique for dnal-ara control based on the method of virtual rafaranoa. Alford aad Belyen (4) 
describe a aatbod for coordinated ooatrol of two eras. Zhang and Lab (5,61 obtain oonatrained relations and 
control laws for two coordinated ara*. Tarn at al. f7] employ tba exact linearization technique for dual-ara 
control. Bayati [Si proposaa a method for controlling dual-ara robots based oa partitioning tha load betwaaa 
tba aras, Koivo [9] suggests an adaptive control technique for dual-sra robots using the self-tuning approach. 
Lia and Chyung [10] describe a positional control scheme for two cooperating robot eras. 

The control architecture considered in this paper is baaed oa the tri-level hierarchical ooatrol of dual- 
ara robots as shown in Figure 1. In this tri-level control architecture, the high level plans the task to be 
performed and decomposes the task into appropriate snbtaaks for the right and left eras. In the intermediate 
level, each anbtask is tranaforaed into a sequence of aynebronoua desired trajectories of end-ef factors motions 
and applied forces. The low level is concerned with the ezecutioa of the desired trajectories aad employs 
feedback from the current status of the aras. In this tri-level hierarchy, the low level "achieves” the 
desired aotlon and operates In "millisecond” tine-scale, the intermediate level "determines" the notion desired 
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for III iiluik it i»4 tha kick lml "pitas" tha uktiik immbcii is v il»iti a Um-miU 

Tko frmit 9 *ft <mrlWi Uni aoatrol slfit«|lci toe lovlml ilifllvi aoatrol ot iMfmtlv* dmal-aro 
robots aalag n««ak rmlti oa HipIlM aoatrol of ila|l«-«ra robot! (21-19). Ia took ooatrol analogy, a 
sal table task-related eeerdlaate frtao of refereaeo it ehooea for both am aad tba desired •otloae iai applied 
foraaa for oaah tra ara expressed la thla aoaaoa rafaroaoo fraao. Thoa oaab ara olll am aa tkoagh It van 
oarrylag oat tho ooaaaa4t4 aotioa by ttaoif ia tho rafaroaoo fraaa. Tfco adaptive aootrollora aaaara that tba 
aoatrolla4 variables traoh tho 4oaira4 rafaroaoo aoaaaa4a aa4 rajaat tho aawaata4 4ftat«rbaaaaa aaaao4 by 
lataraatloa foraaa aa4 torqaes aaarto4 throagh tho lead. 

Tho paper fta atraotaro4 aa follows. Ia faatftoa 2, tho pesitloa-positiea aoatrol atratogy la dlseassed. la 
Sootloa I, tho positioa- hybrid aoatrol strstsgy la developed, Tho hybrl4-hybrl4 aoatrol atratogy ia a44roaoo4 la 
Sootloa 4. Sootloa 5 41aamaaoa tba raaalta of tba paper sad 4rava aoaa eeaelasioas. 


2. MlUtiAlillaAsUtl lumn 

la thla aaatloa. va aboil lavestigste tba flrat aoatrol atratogy for 4aal-ara aaaipalators ia vhlah both 
araa art la para poaitloa aoatrol, aa shova ia Figaro 2. Ia otbar vor4a, tba poaltloaa aa4 oriaatatioaa of both 
aa4-affaatora ara reqalred to traak 4aalra4 trajaatarioa la a aoaaoa fraaa of rafaraaoa. Ia tbia aitaatioa. 
aaooatrolla4 foraaa aa4 torqaes vlll ba aiarta4 oa tba aoaaoa loa4 baU by tba aa4-affaatoro. la tbia see ties, 
va lavestigste tba porforaaaaa of tba poaitloa aoatrol ayotoaa ia tba faaa of tbo lataraatloa foraaa aa4 torqeis 
aaorta4 throagh tba load. 

2.1 failtloa ftiiiailfi tax llihl/1 ill Ata 

Tbo 4yaaa ia ao4al of aaah oaalpalator ara aaa ba raproaaata4 bp a 41ffaraatlal sqaatioa ia Cartaaiaa apaao 
aa (14) 

M(X)X o N(X,X) ♦ 0(X) ♦ B(i) ♦ f - F (1) 


vhoi 


if la 


X.X.X - asl vectors of ead-effeetor poaitloa, velocity aa4 

acoalaratioa la a flsa4 taak-ralatad Cartaaiaa fraaa of 
rafaraaoa 

F “ axl vector of "virtsal" Cartaaiaa foraaa appliad to tba aad- 

affaotor aa tba ooatrol iapot 

M(X) - aaa ayaaatrio positlvs-dif iaita Cartaaiaa sail aatria 

N(X.X) « ul Cartaaiaa Coriolis aa4 oaatrifagal foraa vaotor 

C(X) • axl Cartaaiaa gravity loadiag viator 

B(X) - axl Cartaaiaa frlotioa foroa victor 

f • axl victor of forcaa a ad torqaaa axortad by tba sad-effietor 

oa tba load 

Tba force/ torqm victor f both iaparta aotioa to aad appliaa forca/torqma oa tha load aad acts as tha coapliag 
alaaaat batvaaa tha two araa. Ia tha folloviag aaalyaia. tha foraa/torqaa victor f vlll ba coasidarad aa a 


”dlstar baaea iapot" to tha poaitloa coatrol systsa. Tha faactioa of tha ooatrol syataa is to sasara that tha 
amd-sffactor poaitloa victor X tracks tba axl victor of daslrsd trajactory X d daspita tha distorbaaca foroa f. 
For sach ■aaipalator ara, lit aa apply tha liaaar adaptive poaitloa coatrol lav [12] 

F(t) - d(t) ♦ (Kpd) K ( t ) ♦ L r (t)B(t)J ♦ (C(t) X d (t) ♦ B(t ) i d (t) ♦ A(t) Z d C t ) 1 (2) 

aa shova la Figaro 3, vhars B(t) - X d (t) - X(t) la tha axl poaitloa trackiag-arror vaotor. Ia tha coatrol lav 
(2). tha axl victor d(t) is aa aaxlliary sigaal to ba ayathasixad by tha adaptatioa aehaaa. vhlla H p B ♦ I V B) 
aad (CX d ♦ BX d ♦ AX d ) ara tba ooatr ibat loss daa to tha faadback aad faadforvard ooatrollars raapao t i valy. 
Folloviag tba aathod daaoribad ia lafaraaca (12), tba raqairad aaxlliary sigaal aad ooatrollar galas ara apdatsd 
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n«oHU| to Uf follow 


d. 


i 


1 


t 

*(*> - 4(0) ♦ », / r(t)4t ♦ » 2 »<t> 
t 

* f <t> > 1,(0) ♦ aj / r(t) B* (t)4t ♦ # 31 ( 1 ) *'(») 
t 

*,(») • 1 ,( 0 ) » % x l r(t) B’(t)4t ♦ Oj»(l) B'(t) 
t 

C(t> - C(0) ♦ / r(t) XJ(t) 4 » ♦ r 2 *(t) XJ(») 

B(t) ■ >(0> ♦ n /*»(») *i<*> 4 * ♦ Tjr(t) X 4 U) 

*(») - 4(0) ♦ Xj / t(t) Xi(») 4 ‘ ♦ Xjr(t) 14 (f) 

vitro 


( 3 ) 

<41 

(3) 

<*> 

< 7 ) 

(I) 


r<t) - f f E(t) ♦ I(t) <9) 

it aa nil victor, (6 l# yj, t|* kj) are positive sealers, lij, ^2 # * 2 * Tj* 9 0#ltlT# or 

sealers, tod tic priae domotot t r tot poai Horn. U c^utiot (9). f p tad l t art ui constant v«i|Ml«s aetrieea 
ehosea b y the doaiftor to raflaot tk« rclittvi significance of tho po.itioa tad velocity orrora B and 1 It 
■mat bo aotod that tiaoo wo oaaaot physically apply tho Cartotiaa ooatrol foroo F to tko tad- of factor, wo 
laotoad ooapaco tbo aal eqoiveleat joiat torqae vector T to effectively oaato tbit foroo. Thao, for fttgh 
■aaipalator ora* tbo ooatrol law la joiat tpaoo it given bp 


T(t) - J ’<»> F(t> - J’(»> ( 4(t) ♦ Kp(t)B(t) ♦ C,(t)B(t) ♦ C(t)X d (t) ♦ B(t)i 4 (t) ♦ A(t)X 4 <t)l (10) 

where 9 it tho ail vector of Joiat aagwlar potitioaa aad J(9) la tbo aaa JaeobUa aatrix of tho aaaipalator ita. 


Booaato of tho siapiioity of tho adaptatloa lava (3) - (I). tho robot ooatrol algorithm eaa bo laploaioatod 
at tag high sampling ratoa (typically 1 Us). Ia each taapliag period (** latte), tko ooatrollor galat oaa ohaago 
• iga i f ioaat ly; whereat tho teraa M. N, 0, B, aad f la tho robot aodol (1) oaaaot ohaago aotlooably. Aa a 
ratal t. ia deriving equations (3) - (I), It vat attaaod that thoto teraa are aabaova aad "slowly tlae-vary lag" 
relative to tho adaptatloa law*. It it teea that tho laoiatioa of tko dlttarbaaco force f la the robot aodol 
(1) doea aot of foot tho coatrollor adaptatloa lift alaoo tho ohaago la f over oao taapliag period It relatively 
•■all. 


Tho above argaaoat aaggoata that vhea both aaaipalator araa are controlled voiag tho two JiJiuiilll 
adaptive poeitioa coatrollert, we expect tho osd-eff retort to track tho dealred potitioa trajectorlot doaplte 
tbo interact ion force* aad torques exerted throogh tho load. It aaet bo aotod that alaoo tho forco oa tho load 
ia aot a coatrolled variable la thia scheae, thia atratogy caa load to madoolrablo load force* whoa tko poeitioa 
trajectories are aot plaaaod ia eoordiaatioa or are aot tracked cloaoly. 


3. P9flU9B-glVf id CpBUftl 3 1 X1.1111 

Ia thia section, tho poai t ioa-hvbrid control strategy for dael-ara ■aaipalator* will bo atadlod ia which 
tho loft tra ia ia pare position control and the right ara la ia hybrid pot 1 1 ion/ for co control, aa shown ia 
Figaro 4. Ia other words, for tho loft ara, tbo tad-effector position ia required to track o desired trajectory 
ia a front of reference. For the right ora, ia tko eaao reference frcao, tbo contact force between tho end- 
effector and tho load anat bo controlled ia tbo directions constrained by tbo load, while tko ead-effoetor 
poaitloa ia to b# controlled a iaal taneonaly ia the free directions. Thia ooatrol strategy can bo applied when 
one robot era la confined to operate only in position control aode whereat the other era con be controlled ia 
hybrid control aode. 
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l.i ftiliin QwlxtUix At UA An 


carted llrM|l III Ui4 in «iui4i 
R 4iitiflm«i,* i»4 III ilifthi fiiHlii aoatrol ijitia m ititn truklai if Hi lulu! paaitiea 
triJiiiMin Infill ml IlituliMii, at ntliail U Intiit 2. Hi iliftivi fiiltlii mini lav for ill 
laft ara aim U Pifin I la |iti« by 111] 

T t (t) - *((•(> (4(t) ♦ I^d>t(t) ♦ I,d>iu> ♦ ?<t)X t4 (t) ♦ 7(01(4(1) * I<t)i t 4<»)) (11) 

tkifi T( t« til a«l J«lil l«i|M fHioi, #( t. tk* ail jolat aagl* /((•(> •• ,k * aaa Xt.okl.a aitili, 

1(4(1) la 111 ail finir of liainl Cartaalaa positlea trajsetary, B(t) * l(|(t) - X(t) la tha ail faaitiai 
trsekiag-errer vaetor aa4 tka terms la a^aatioa (11) ara adapted aa follova: 


„ t 

4(0 -4(0) ♦ *! / r(t)4t ♦ *j»(t) 

(12) 

fyt) - f_(0) ♦ «, / , r(t)S’(l)4t ♦ • 3 f(l)S'(t> 
w w 0 

(13) 

t,(») -1.(0) ♦ lx / * r(»)S'd)4t ♦ 4 2 <(t)S , (t) 

(14) 

7(t) -*<0> ♦ / ^d) I( 4 <t)dt ♦ »jr(t) Z( 4 <t) 

(IS) 

t 

1(1) -f<0) ♦ T( / f(C) X(4(t)4t ♦ Tjrd) X^d) 

(id) 

- - ‘ 

A(t> - A(0) ♦ / rd) X ld d>4» ♦ X 2 r(t) Xt 4 (t) 

(17) 

. 

r(t) - o^mo ♦ 9 r id) 

(11) 


tad Ik# symbols ara daflaed la Seotioa 2. 

3-2 Bxfexii Coaugllfi lax tiiM kin 

Va Skill aov diioaat tka hybrid posit &oa/ forea ooatrolltr for tka ri|kt ara. Coaslder a taak-ralatad 
"coastiaiat fria«" (coord i ante aystea) thick la daflaed by tka partioalar eoataet aitaatioa oecerriag betveeathe 
rlgkt ead-ef factor aad tha load. la tkia fraae. tka a degraea-of-fraedwa (or diraotioaa) ia tka Cartaalaa apaaa 
(X) oaa ba partitionad iato tvo ortkogoaal aata; cka a ooaatralat diraotioaa ia aabapaoa (X) aad tka 1 fraa 
diraotioaa la aabapaoa (T), vitk a • a ♦ l. Ia tka a ooaatralat diraotioaa. tka aad-af factor aakas ooataot with 
tka loMd aad tka saalAMl lata*. ittdi to m aoatroilad. Ia tka 1 fraa diraotioaa. tka aad-affaator ia fraa to 
aova aad tka ftliUlilllDX fflililO i* to ba ooatrollad. Ia tka hybrid control archltaotara (16.171. tvo 
aaparata controllers ara employed for aiavitaaaoaa forea aad posit ioa ooatrol. Tka "force eoatroller* aehtevas 
tracklag of daairad forea aatpoiata ia tka ooastraiat diraotioaa; vkila tka "poaitioa eoatroller” aoeoapliahes 
tra eking of daairad poaitioa trajaotoriaa ia tka fraa diraotioaa. 

Tka dynamic aod«l of tka rigkt ara ia tka ooastraiat diraotioaa eta ba vrlttaa aa (13) 

A(X.i) P(t) ♦ BfX.i) Pit) ♦ ?(t) ♦ C p (7) t f* * V° (i9) 

vkara P f ia tha ail "vlrtnal" Cartaalaa forea vaetor appl lad to tka aad-affaotor ia tka ooaatralat diraotioaa, Z 
ia tka axl vector of tad-effector poaitioa. tka ara aatrlcas A aad B ara kigkiy complex noallaaar faaotioas of 
tka aad-affaotor poaitioa X, C p ii tka oroaa-eoapl i eg from tka poaitioa loop ia tka force loop aad f g is tka 
poapoaant of tka forea aiartad oa tka aad-affaotor by tka load la tka ooaatralat diraotioaa. Tka tara f g 
raprasanta tka oroaa-eoapl lag tkat exists betvsaa tka arms tkroagk tka load aad ia eoasiderad as a *dia tarbasee” 
to tka kybrid coatroilar. 

In a raoaat papar [13], aa adaptive foroa ooatrol schema la davalopad vltkia tka kybrid ooatrol 
areki teotmro. For tka rigkt ara, tka llaaar adaptive force ooatrol lav la tka ooaatralat diraotioaa is givta by 
(13) 
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r,<«> - r # u> ♦ 4(t) ♦ ijtt) *<*» ♦ *i<«) t ■ «,<*>*<*> 


•• ikm ia Fi|ui 5, tlMtt F f (t) it tk« 4,tlr*4 tntiit fare* aa tka laa4 aaa4 aa a fa#4faraar4 tara. 4(t> tc 
„ (aail iary atfaal. I(t) - F,(t) - Kl> la tka 4aaUtlaa at tka aataal faaaa P(t> fraa tka 4aalca4 aalaa. aa4 


(4 f (t). K|(t)> 4,(0) 

a4afta4 aa fallava: 


•liar. Tka 


1 la* (M> 


4<t) - 4(0) ♦ * t / «<t)4t ♦ *j«<t) 
0 


C x (t) - 4j(0> ♦ a x / »(t) <t)4t ♦ ajO(t) «•'(!> 


4 f (t5 - 4,(0) ♦ / «(t) 4* (t)4t ♦ Sj«<0 ■*( 


4,(0 - 4,(0) - ri / <(t) Z’(t)4t - TJ0<»> X*(t) 


*(0 • *i 4*( t) ♦ 1,4(0 - *,2(0 


la aaaatiaaa (21>-(2J). BOO • / B(t)4t It 

0 


(4j. *|, ri> ara paaltiat taali 


U 2 , ,j, pj, TJ ) art potitl.t or aaro aaalara. aa4 (*j. *.. t v l ara aaaataat «ai|ktla| aatrloaa akaaaa by tka 
4atl|aar to refloat tka ralatloa aifaifloaaaa of 4*. B aa4 2. 

Tho dyaaalo aod.l of tko rl|ht ara la tka frao Blraatleaa oaa bo arlttaa aa (IS) 


* (X.i) 5(t) ♦ B # il.i) i(t) ♦ C 0 (I.i)T(t) ♦ C f (P> ♦ ty * V tl 


vhiri f it tk« coapoaent of tko •td*«f(«otor foreo ia t)i* fm direetioaa, Cy it the croet-eoapllng froa tko 
forot loop* A 0 , B 0 , C 0 trt oonplex aoallattr aitrloot, T it tko end-effector potitioa vector tad F y i* 
"olrtaol" oad-offootor ooatrol foreo. For tko ri»ht ora. tho llaaar adapttvo pooltloa ooatrol la* la tba frao 
diroctloat It givoa bp 

F y (t) - 3(t) ♦ t p (t) Kp(t ) ♦ f T (t) Bp( t ) ♦ 2(t) Kf) ♦ Kt) i(t) ♦ X(t) i(t) (27) 

tt ia Soctioa 2. where t it tho dotirod potitioa trajectory. Bp - I - T it tko potitioa tracking-error, tad V 
it tko "vlrtaal 1 Cortotiaa foreo la tko froo dirootloat. That, ia order to iapleaoat tko foroo tad poaitio 
ooatrol lert (20) tad (27) it tho hybrid ooatrol architecture, tko Joiat tptco ooatrol low for tko right ara it 
given by 


f ' \ 

r r u> - J'<» r ) 

\ Hy<*> / 


tt ahowa ia Figaro 6, where i f it tho joiat taglt vector, T f it tko joiat torgae vector, tad J r it the Jacobian 
aatrii of tko right era with appropriate reordering of eolnaat of J f if tocetttry. 

Tko hybrid controller adaptation loot (J)-(l) tad (21)-(24) are extreaely tiaplo, tad therefore tko control 
algorithm eta be iapleaoated atiag high aaapliag ratet (z 1 EBt); yielding in proved perforataco porticalarly la 
force control tppl let t ions. Since in etch ttapllng period i aeoc) tko terat in the robot aodela (19) tad 
(26) cannot change noticeably, it it rettoaable to ttaaae that theta terat are "alowly tine-varying* ooapared to 
tko adaptation tekoae. That tko laolaaloa of the dittnrbaace forcot f, tad tj ia tko robot aodela (19) and (2d) 
doet not affect tko controller perforataco. 

Vo conclndo that neiag tko poaitioa-hybr id control atrategy, the left oad-effector will traek the dotirod 
potitioa trajectory deapite the iateractioa forcot throagh the load. The right ead-effector will eaert tko 
detirtd force oa tko load ia certain dlreetiona while tianl taneontly tracking tko dotirod potitioa trajectory la 
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ts ■ 


tk« ortho fatal 4inttiou. It n*t ba m(W tkat is tbit aoatrol atratafy. all|bt flaataatloaa u; ba okiinii 
•* (wm 4m ta *«r aaall vlbratioaa of tha laft a to m4u potitloa aaatrol. 


la tfcla aaatiaa. the hybril-hybril aaatrol atratagy for laal-ara aaalralatera *111 bo stalled U ahiak kotk 
araa art la kybril post tiea/foro* aaatrol. at akaaa la rigara 7. la atkar tadi, la a aaaaaa fraaa af 
rafaraaaa, fa* botk araa. tka faraaa aaartal by tka tal-af fsttors aa tka loel la tka aaaatralat liraetteaa (Z) 
■••t ba aaatrallal, aklla alaaltaaaeaaly tka aal-affaatora ara taftlral ta traak la a Irak yaaltlaa trajaotorlaa 
la tka fraa llraatioaa (T). lay aaaaatal faraaa aal torqaat aa tka laal (aaaratal by tka ralatlra yoaitiea aal 
MlaaUtlaa of tka aal-affaatara *111 art aa 'littarbaaass," aal tka alaytlra kybril aaatrallara aaaara tkat tka 
laalral posit iaa/ ferae trajaatarlaa ara traakal laaylta aaak llatarbaaaaa. . 


4.i Hun flmaiiu lu Kiiki/Ufi Au *■ 

Fella* lag laatlaa 1. far aaak aaalyalatar ara tha kybril yaaltlaa/faraa aaatrol lav la tka jalat spaa* aaa 
ba vrittaa aa 


/v«\ 

- JMd) I ) 
\F y u>/ 


•koto 1(1) It tko Jtooklaa stir la (wltk appropriate oolmaa reorder i*$ If Mtitury), aa4 F t (t) u4 F y 
•wlrtaai" Carter ita foroet tpplied to tko tad-offeetor la tko coaetralat direetioat (Z) tad frit diroi 
respectively. Am tkowa la Figaro 9, tko foroo ooatrol law it glvea ky 


<t) aro tko 
etloat (I). 


V«> • *r<*> ♦ ♦ *r<t) / «,(t)dt ♦ ^(01,(1) - (JO) 

ahora P r (t) la tka laalral fore* aatpoiat, 8,(1) - F r <t) - F,(t> It tka forca traekiaf-orror aal tka alaptat* 
lowt tro: 


d<t) * 4(0) ♦ 6 1 *(t)dt ♦ 


*t<t) * f ! (0) ♦ «! / q(t> Bj'(t)lt ♦ «j,(») gj'(t) 


*p<t> * Bp<0) ♦ >1 / q(t) Bt( t)lt ♦ l2«(t) Bj(t) 


V*> - t^O) - ti / q<t) Z’(t)dt - y 2 q<t> V (t) 


«(t) - W lB J(,) ♦ *pB c (t) - p^zct) 

# t 

^ *»(t)lt aal (fj. fp, l,| ar* laalral vaightiag aatrieaa. 

Tko potitloa coatrol low it otpretted «t 

r,(t) - i(t> ♦ Kp(t)Bp(t) ♦ E^oiyU) ♦ ?<t>B<t) ♦ I(t)i(t) ♦ I<t)i<t) on 

wkoro *(t) It tko dotirod potitloa trtjectory, H y <t) * *<t) - T(t) it tko potitloa treeklag-error tad tko 
odoptotloa lowt tro: 


d(t) • d(0) ♦ 5« / r (t)dt ♦ ? 2 r(t) 
1 0 * 


164 


_ t _ 

!,(*> - 1,(0) ♦ tit) B£(t)4t ♦ , 3 f(l)l}(t) 

t,(*> ‘ V®> ♦ M / # **(*> «y< *)4* ♦ 7 2 '<t)if(t> 

t(*> - 1(0) ♦ / *,(t) t' (t)4t ♦ Kf(t)«'(t) 

0 

I(»> -1(0) ♦ rj / *!(*) i*(t)<» ♦T 2 *(t)i'(t) 

0 

1(0 - 1 ( 0 ) *\ I **(») rout *xl((oi , (t) 

0 

vk*r« 

-<*) m ^tMf(t) ♦ "? v i T (t) 

ia4 in desired v«l|ktii| 

Ill i!«yi ooatrol ler UiptitUa lavs are utrmly staple. sad Uanfori, tki hybrid ooatrol algorithm oaa 
ki iapleaeated asiag ki|k stapling ratal (* 1 Di), yielding iaproved performance. Urn da r tka adaptive hybrid 
ooatrollers. both oa4-ef factors ara expected to exert tha daairad forces oa tha load vhlla aiaal tamaoaal y moving 
oa desired trajectories. Tha hybrid-hybrid aoatrol strategy is aoat aaitabla vhaa aiaaltaaaoaa aoatrol of both 
posit loa aad foraa ia reqmired. 


5. MlHlliiLHl CtMllliMI 

TVraa adapt Its aoatrol atratogiaa for cooperative daal-ara robots ara daaaribad ia this papar. Ia thaaa 
strategies. aaah robot ara ia ooaaidarad a sabsystea of tha total syataa aad ia aoatrollad ladioondsatlv aaiag 
aa adaptive ooatrol lar ia tha low level of tha ooatrol hierarchy. Bach ooatrol lor aaaaraa that tha controlled 
v ariablas follow daairad ooaaaads aad raj sot aavaatad cross-conpliag affacta froa othar sebay stems vhioh ara 
traatad aa "distarbaaees.” Tha aabayataaa ara ooordiaatod throagh trajaotory gaaaratora ia tha iataraadiata 
level. vhara ayaohroaoaa desired trajaotorioa for both araa ara apooifiad ia a Qftp qp p task-related fraaa 6f 
reference. Aa iaportaat faatara of tha praaoat approach ia that tha overall ooatrol syataa for N cooperative 
araa ia radaoad to N daeoatral isad iadapaadaat single-ara ooatrollara. Tha ooatrol sohaaaa do aot raqairo 
eoaaaaiea t loa aad data ozehaags aaoag individual ooatrollara. vhioh is aa appeal iag faatara froa both 
oowpmtatioaal aad ral lability poiata of view. Fmrtheraore, availabla taohaiqaaa for aiagle-ara ooatrol oaa bo 
atilisod dirootly ia aaltiplc-sra environments. 

Tha ooatrol atratagios dasoribad ia this papar do aot rsqairs tha kaovlsdgs of tha load paraaatora aaeh as 
■ass a*d stiff mass or tha robot dynamic parameters amok as limk masses aad laartlas. aad oaa tharofora oopo or 1th 
aaoartaiat loa or variations ia tha syataa paraaatora. Fartheraore. tha soaplos dyaaaio aodal of tha araa la aot 
aaad ia gaaaratiag tha ooatrol aotloaa. Tha ooatrol sohaaaa ara vary siapls aad aztraaaly fast for on-liae 
iaplsasata tioa vith high aaapl iag rates, yialdiag iaproved dyaaaio parforaaaoa, Tha ooatrol aathodology 
dasoribad ia this papar oaa also be atilisod ia tha ooordiaatod ooatrol of N-ara robots vhaa N exceeds two. 

d. AfhlPTlfditll 

Tha research dasoribad ia this papar was performed at tha let Propel tioa Laboratory. California laatitato 
of Taohaolog y. eadar eoatraot vith tha National Aoroaantios aad Space Ada ini sirs tioa. 
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Abstract 

Using manipulators with a fixed configuration for specific tasks is appropriate when the task requirements are known 
beforehand. However, in less predictable situations, such as an outdoor construction site or aboard a space station, a manipulator 
system requires a wide range of capabilities, probably beyond the limitations of a single, fixed* configuration manipulator. To fulfill 
this need, wG have been working on a Reconfigurable Modular Manipulator System (RMMS). 

Unlike conventional manipulators with fixed configurations, RMMS win jtitize a stock of interchangeable link and joint 
modules. Given requirements such as the workspace, dynamic ataur apff and the payload required to accomplish a task, the 
RMMS will design the most appropriate manipulator configuratioryMlect suitable modules form the inventory, generate an 
assembly procedure, configure the controller, and finally apply the/lsultam manipulator to the task. In this way, the RMMS will fill 
a far wider requirement space than any single manipulator. \\j6 also lnhe)bntly easy to maintain and transport, since it can be 
readily assembled and disassembled. 

<7 **-c r - -» V ^ ' 

Whave designed and are constructing a prototype RMMS. The prototype currently consists of two joint modules and four link 
modules. The joints utilize a conventional harmonic drive and torque motor actuator, with a small servo amplifier included In the 
assembly A brushless resolver is used to sense the joint position and velocity. For coupling the modules together, we use a 
standard electrical connector and V band clamps for mechanical connection, although more sophisticated designs are under way 
for future versions. The joint design yields an output torque fo 50 ft-lbf at joint speeds up to 1 radian /second. The resolver and 
associated electronics have resolutions of 0.0001 radians, and absolute accuracies of ± 0.001 radians. Manipulators configured 
from these prototype modules will have maximum reaches in the 0.5 to 2 meter range. 



The real time RMMS controller consists of a Motorola 68020 single-board computer which will perform real lime servo control 
and path planning of the manipulator. This single board computer communicates via shared memory with a SUN3 workstation, 
which serves as a software development system and robot programming environment. We have designed a bus communication 
network to provide multiplexed communication between the joint modules and the /computer controller. The bus supports 
identification of modules, sensing of joint states, and commands to the joint actuator. [This network has sufficient bandwidth to 
allow servo sampling rates in excess of 500 Hz. \ ^ 


1 . Introduction 


Many applications of robotics in space will be different from the typical applications found in industry. Unlike conventional 
industrial manipulators -vluch work in a precisely known and controlled environment, a space robot is envisioned as a backup 
astronaut, performing construction, maintenance, and ex pen mentation. Such a robot must be capable of a wide range of tasks, 
from small scale, high precision operations such as replacing electronic components in faulty equipment, to immense scale, such 
as assembling room sized structures into a space station. Many of these tasks will be poorly defined or completely unexpected, 
particularly maintenance operations. 


It is inconceivable to develop a single manipulator wh*r:h meets even those requirements we can predict as necessary for a 
space manipulator. The level of dexterity and versatility r'ovicied by current manipulator technology is sufficient for only very 
constrained or well known operations. Nor is it practical, given the expense of transporting material to space, to maintain a large 
number of different manipulators at potential task sites. Our solution is the development of a manipulator system, which can be 
readily adapted to meet the individual constraints of each particular application as it occurs. 
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Designing s manipulator for a single, specific task is conceptually the same process as that employed now for conventional robot 
applications. However, at present the process of designing and manufacturing a robot system for a particular application to 
generally too tong to be practical for the highly variable and urgent tasks that arise during space missions. To achieve the desired 
range of robot capabilities, streamlining and automation of the system configuration process are required. 

In order to explore tnis approach, we are currently designing a Reconfigurabie Modular Manipulator System, or RMMS. Tba 
RMMS is a collection of manipulator components or modules (links, Joint*, actuators, and end effectors), with e wide range o* 
performance (length, strength, torque, speed, resolution, efc) utilizing common electrical and mechanical interfaces. Thisdesifln 
allows a large number of different manipulators to be assembled, at the task site, from a small inventory of components. In parallel 
with the development of reconfigurabie hardware, a similar software effort Is underway to automate the generation of servo- 
controller and path planning algorithms, provide a simple means of programming the manipulator task, and actually synthesize e 
workable manipulator configuration for a given task. Such a manipulator can thus be custom tailored to perform a specific taefc, 
and then broken down and re* used in a different configuration when a new task arises. 

In a sense, the RMMS concept is an extension of conventional interchangeable manipulator end tools. To date, however, tfre 
major efforts In this area have been in the development modular hardware, so that a robot manufacturer can produce varlou* 
manipulator configurations from standard sub assemblies [ij. or to allow remote maintenance of manipulators In hazsrdous (# 0 . 
radioactive) environments (2). In contrast, the aim of our RMMS effort is to develop a manipulator system which is modular and 
reconfigurabie by the user at the task site. 


2. Design Philosophy and Implementation 

An RMMS consists of the same major subsystems as those found in conventional manipulators: 

• A pnysJcal structure made up of joints and links. 

• Servo systems lor each joint, consisting of actuators, transmissions, and sensors. 

• A computer controller and programming environment. 

The maior difference between an RMMS and a conventional manipulator are the standardized component interfaces. This 
includes the mechanical mating of manipulator modules, the lonnat of data communication, the communication protocols between 
hardware and software, and between various levels ol software. Although adopting such standards impose inherent restrictions 
on the design of the actual components, this disadvantage is far offset by the interchangeability of manipulator components and 
the capability for rapid reconfiguration. In the following subsections, we discuss the conceptual design of each major component 
and interface in the RMMS we are developing, and the actual implementation in the prototype system. 


2.1 . Link and Joint Modules 

The mechanical modules making up an RMMS are divided into two groups, joints and links. Links are simply structural elements, 
and joints are servo mechanisms, made up of sensors and actuators. When we represent the kinematics of a manipulator by a 
series of transformation matrices representing its links and joints, links have fixed transformation matrices, while joints have 
variable ones (a function of the joint variable). Electrical power is bussed and communication is multiplexed over a small number 
of conductors permanently installed in each module, allowing for simple assembly without custom cabling. 


One implication of this modular joint design i5 that the entire joint actuator must be packaged within the joint module. Each jo«nt 
module must include a motor (or some type of actuator), a transmission mechanism, a position sensor, and the necessary power 
electronics to control the motor. Although these design constraints limit the power which can be generated by the joint due to ttie 
limiied size of the motor, transmission, and power amplifier, this is not viewed as a major short comming of the design, particularly 
for space applications. By properly selecting tho transmission reduction ratio, high torques at low speeds can be obtained, which 
is appropriate for manipulating massive objects in near zero gravity (and also on earth) as long as speed of operation is not critical. 



Figure 2*1: Modular Joint Assemblies 
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For simplicity and convenience, we are considering only the two common types of revolute joint In our RMMS. These two types 
v e rotate, and pivot, and are distinguished by the orientation of the joints link axes with the joint axis. Both types of joint are shown 
schematically In Figure M. A rotate type joint has link axes which are co- linear with each other and with the joint axis. A pivot has 
Hnk axes which are both perpendicular to the joint axis. 

Our current design for a pivot joint Is shown in the photograph in Figure 2 2, and In the section drawing In Figure 2. The joint 
actuator is a conventional servo motor and linear amplifier driving a harmonic drive with 100:1 reduction ratio. This dealgn yields • 
maximum output torque of flOfWbf, and maximum axis speed of 1 radian /second. Also integral with the joint assembly is • 
brushless resolver mounted coaxially with the output shaft, providing position feedback with an accuracy of ±0.001 radian. If 
lower position resolution Is acceptable, lower resolution (and less expensive) sensing electronics can be installed In the joint 



Figure 2*2: CMU RMMS Prototype Pivot Joint 



Figure 2-3: Section View of an RMMS Joint 

module. A wire windup allows the resolver (and output shaft) to turn up to 460° before damaging the resolver electncal 

connections All of the actuator components are packaged in a sub assembly of the joint module, allowing a number of different 
types of module to be based on common parts. The total weight of the joint is 25 lbs. A more compact and lighter version of this 
joint, as well as several kinematic variations on this basic design are currently under development. 


173 






2.2. Joint • Link Interface 

In order to assemble the joint and link modulo* into a manipulator, a method of mochanlcally coupling the modulo* la required. 
Thla coupling must both align the module*, and lock them together with sufficient strength to transmit the Internal forces 
generated by the movement of the manipulator. In addition to structurally coupling the modules together, this Interface must also 
electrically couple the modules, and be able to sente the coupling orientation of successive modules. 

The current interface design Is shown In ths photograph in Figure 2-4. An arrangemsnt of pins tf)d holes Mmit the coupling 
orientation to four, equally spacad positions. An LEO In on# flange and four phototransistora In the other allow the controller to 
sense which of the four possible orientations Is in use. A commercial V band clamp couples the two flanges together. As shown m 
ths figure, the same coupling flange it an Integral part of the link modules. Although rudimentary, this design provides the 
necessary functionality for the module Interface. However it is not easily operated. Future versions will make use of either quick 
release V-band clamps, or a more sophisticated design with an automated locking mechanism to allow automatic "peg in-hole* 
type coupling. 



Figure 2-4: Prototype Module Interface 


2.3. Communication Interface 

As mentioned, each iomt will contain the power and sensor electronics for the actuator In order to control the jomt actuator* 
and obtain sensor feedback, a communication link between the ioi.it modules and a computer controller is required, in order to 
allow standard connectormg between joint modules, this communication link must be implemented using a fixed number of 
conductors, yet be capable of supporting an arbitrary number of modules. This implies a multiplexed communication link, similar 
to a computer bus or LAN. 

Due to the high overhead associated with existing LANs, our prototype utilizes a bus type implementation. The design is shown 
schematically in Figure 2 5 The bus design is based on a conventional 8 bit bi directional data bus, an additional 5 control line*, 
and a rather unconventional 4 bit daisy chained address bus. The daisy chained address bus provides automatic node addres* 
configuration, that is. the first module in the manipulator is node address 1 . the second module is node address 2, and so on. This 
is accomplished by including a "subtract one" circuit in each module which is in the path of the node address lines. Each joint 
can thus detect 'address equals zero" as the node address. Due to the low data rate of the bus (current bus clock is 500 KHz), the 
propagation delay added by the subtract circuit is negligible. 

2.4. Software Controller 

In order to augment the capabilities of the RMMS hardware, a reconfigurable control and path planning algorithm are required. 
These algorithms would allow the control computer to automatically synthesize a control program for a given manipulator 
configuration and task requirement. This entails automatically deriving the manipulator forward and inverse kinematics, and 
inverse dynamics given the kinematic and dynamic parameters of the modules. This is one of the major research area of our 
RMMS project. 


m 






Figure 2-0: Schematic o> RMMS Computing Architecture 
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2.3. RMMS Computing Archltocturo 

Tho reaMme control software In our RMMS runt on i dedicated c ontroittr CPU . with a hardwaro interface to tho inter-modulo 
coutnHJfiictftfton network. This conUolta CPU wilt perform the necessary realtime uontiol of the mamputalur. wxf receive 
commands from a second, metier CPU . The intent of this architecture is to have the manipulator controller appear as s periphery 
In a standard micropro c e ss or system. Communication between the master and controller CPU will be via a number of memory 
mapped control and status registers maintained in shared memory on the controllor CPU. The allows the controller CPU to be 
readily integrated Into any master CPU system sharing the same system bus. 

The present implementation of this architecture is shown in Figure 2 6. The controller CPU is an Ironies stogie-board computer, 
based on s Motorola 66020 processor and VME bus. with 1 MByte ot dual ported RAM. The master CPU is a SUN3 workstation, 
also based on the Motorola 66020 and VME bus. The similarity between the two machines allows us to use the same editor end 
compiler for both processors, simplifying software development and mter- processor communication. The interface to the 
manipulator communication network is via the VMX bus interface included on the Ironies. The VMX bus is s recognized extension 
to the VME bus, intended as a local \0 bus in multiprocessor systsms such as this. 

3. Span of Possible RMMS Configurations 

In order to illustrate the range of capabilities of an RMMS. we have estimated several performance specifications tor alt of the 
manipulators possible given a reasonable size inventory of components. This set of specifications is used to dassrfy the possible 
manipulators into groups with simitar characteristics. By noting the number of different configurations within each class, we con 
gam an appreciation for the versatility provided by an RMMS. 

3. 1 . Module Inventory 

Based on our current design effort, let us consider an example RMMS with a module inventory consisting of: 

• 25 foints, consisting of 5 sets with maximum joml torques of: 10 Mm. 30 Mm, 60 Mm. 120 Nm and 200 Nm. Each set 
consists of 3 pivot and 2 rotate joints. 

• 12 joint position sensors, consisting of 3 each of the following resolutions. 10 bit, 12 bit 14 bit and 16 bits. 

• 20 links, consisting of 4 each of the following lengths: 0. 1 m, 0.2 m, 0 5 m, 1 0 m and 2.0 m. 

3.2. Obtainable Configurations and Range of Specification# 

To limit the scope of this analysis, let us make the following assumptions: 

• The manipulators wiH be 3 degree- of freedom robots, made up of revolute joints. and writ end in a conventional three 
axis (rotate pivot- rotate) wrist. Thus the final joint of the three axe manipulator should not be a route joint. as this 
would be redundant. 

• Two typee of revolute joint assemblies will be considered: pivot and rouf \ wf^ch ire shown in Figure 2- 1 . 

• Consecutive pivot joints must have either parallel or perpendicular axe or tarnations (in terms of the Oenavtt- 
Hsrtenburg convention a • 0° or 90°). 

• The orientation of the manipulator with respect to the world wilt be ignored. 

Within these assumptions, there -e seven basic meaningful manipulator configurations, which are shown in Figure 3* 1 . Within 
each of the seven configurations, numerous combinations are possible by using links of assorted lengths and snorted joint 
module designs, resulting in a wide range of manipulator performance. 

The manipulator specifications we have considered in comparing the obtainable manipulator configurations are reach, tip force, 
and bp position accuracy. The specifications were defined as follows: 

• Reach: length of the manipulator when extened We use the sum of the link lengths as an estimate. 

• Tip force: the minimum of the three joint torque capacities divided by each joints longest moment arm. Measured in 

Newtons. 

• Accuracy: the root mean square value of the uncertainty in tip position due to the error in measuring the joint 

positions. Measured in meters. 


176 




Figure 3* 1 : Meaningful Manipulator Configurations 


We have calculated the number of obtainable arm configuration* with vertoua combinations ol tip force, accuracy, and reach. 
The results ae given at histogram* in Table 3*1. which show how the obtainable configurationa are distributed over the range of 
specifications. 


We can interpret the histograms in Table 3-1 by considering the daeeification of typical robot tasks and performance 
requirements shown tn Table 3-2. Ignoring speed requirements (which are generally imposed by economic thru-put conaninta 
rather than the ability to perform a task) we can sse that the obtainable range of manipulators spans aach of the four major daaeee 
of task. 
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Table 3- 1 : Histogram of Obtainable Manipulator Configuration Distributions 
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4. Summary 

An RMMS w daacritad In toto ptpw can provfcfe a wMu rang* o4 manipulator porformanca horn » tonoll amount ol hardware. 
Thto la vary Important lor pa do rmtoQ MAa which tra aMhar poorly daAnad. ohan ona ton# occurrancaa, or to e Had in laola t ad 
amdrowma nt a. Examptoa ol auch ta*a Inciuda nuclaar plant malntananca and practically any typa ol manipuMton raqulrad 
Moard a apacacrafl In addtoon to too I n c r aaaa In capability attordad by an RMMS. toaro ara vartoua aide banal* to auch a 
modular daalgn which maM I vary attrac tom lor commardal uaa, moat notably too aaaa ol matotatotoQ, modifying. and updating 
auchaayatatn. 

too ara now buddtog a amaB RMMS prototypo. Aa ol January 1987, two ioint modulaa and lour to* modutoa hava boon 
machlnad. and ara now baing I n tagratad with a computar control ayatam. At too aama tona. toaoratfeal wo* la undar way to 
davatop Mgorlthma lor ganaratlng manipulator klnamatica, dynamlca, and control. A ma|or aflort la being made to rattoca tha alta 
told weight ol too currant Joint daalgn. which la a tonitlng factor In both manipulator ipoad and payload. Flrat operation o I too 
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Examples of typtcal class applications: 


Clast 

I - 

precision assembly, printed circuit component 
Insertion. 

Class 

II - 

small part handling, pick and place assembly, 
loading and unloading of machine tools 

Class 

III - 

large part handling, high contact force 
operations, surface grinding and deburrlng 

Class 

IV - 

seam tracking for welding or sealing, 
spray painting 


Teble 3*2: Manipulator T ask Classification 


system is expected in the second quarter of 1067. 
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i n t hi i p ayee - we l edil multiple coordinated robot arms /fey considering the ares (1) as 
clossd kinematic chains and f2) as a fores constrained mechanical systea working on the sane 
object simultaneously. In both formulations a new dynamic control nethod is discussed. It is 
based on a feedback linearization and simultaneous output decoupling technique. Applying a 
nonli ear feedback and a nonlinear coordinate transformation, the complicated model of the 
multiple robot arms in either formulation is converted into a linear and output decoupled 
system. The linear system control theory and optimal control theory are used to design robust 
controllers in the task space. The first formulation has the advantage of automatically 
handlinq the coordination and load distribution among the robot arms. In the second 
formulation, by choosing a general output equation we could superimpose the position and 
velocity error feedback with the force-torque error feedback in the task space simultaneously. 


2. Introduction 



The notion of "multiple robot arms** originates from two everyday scenarios. The first 
scenario is an authropomorphic one by noting that humans have two arms and hands and everyday 
manual work is normally performed by two-handed humans. In fact, manual activities and tasks 
are normally perceived and designed such that they assume two-handed humans; a one-handed 
person is a handicapped person from that point of view. Thus, in order to replace humans with 
robots to perform normal manual activities it seems natural to visualize and design robots 
with two arms and hands. The second scenario is an industrial one by noting that production 
lines in industry assume an organized distribution of manipulative activities along the 
production line that can be carried out by a distributed set of robot arms in a proper 
arrangement. 


Scenarios of multiple robot arms are also assumed and predicted for space applications in 
a natural way. Space station assembly, maintenance and servicing will require the in-site 
manual work of EVA astronauts in the initial operational configuration. This manual work also 
includes the simultaneous activities of two or more EVA astronauts in the handling or 
assembly of large structura* elements in space. Most satellite servicing and maintenance 
operations also assume two-handed manual work of EVA astronauts. Thus, the objective of 
decreasing EVA activities in Earth orbit by introducing and increasing robot activities there 
requires the consideration and the design of the control of multiple robot arms. 


The technically interesting and challengine problems in the control of multiple robot 
arms arise when (i) the work envelopes of two or more robot arms overlap and (ii) two or more 
robot arms simultaneously work on the same object in a presumably cooperative manner to 
perform a given task which cannot be performed by one arm only. 


The Control problem of two or multiple robot arms has been studied by many investigators 
[1-12]. Although the control problem of two or multiple arms is complex, some examples of 
applications, such as a two-arm lathe loader, a two-arm robot press loader/unloader, and two 
single-arm robots working together to handle stamping press loading and unloading, are given 
by Chimes [1]. In these applications, the problem is solved specifically. The system design 
is based on a solid understanding of the problem. 

Hemami and Wyman [2] investigated the problem of force control in closed chain dynamic 
systems. In their work, the dynamic system is linearized about an operating point and linear 
feedback is used to maintain the forces of constraints. The validity of the method is 
restricted to a rather small neighborhood of the operating point in which the dynamic system 
can be linearized, orin and Oh [3] considered the control of force distribution in robotic 
mechanisms containing closed kinematic chains. The problem of solving for the input joint 
torques from a given trajectory is underspecified. The linear programming has been used to 
obtain a solution which optimizes a weighted combination of energy consumption and load 
balancing. The dynamic equations of the mechanisms are excluded from the control method. The 
stability of the control algorithm is in no way ensured. Ishida [4] developed a force control 
technique which uses a wrist force sensor to measure the interactive force between two arms. 
The parallel transfer task and the rotational transfer task are .considered only. The control 
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algorithm is dsrivsd for both mastsr/slavs soda and indiatinguiahad mode (tha saaa status 
■ode). Pujii and Kurono [5] proposad tha mathod of virtual rafaranca. This method consists 
of tha idantification of tha joint control aoda raquirad to parfora a dasirad Cartaaian 
motion. Tha control loop at aach joint usas only position feedback and no coapansation for 
tha coupling between joints. 

Alford and Balyau [6] hava dasignad a hlararchical computer control structure for two 
PUMA robot arms operating in a aastar/slava aoda. Tha proposad coordinated control system has 
joint position predictors , a coordinate transforaation, and a slave coaaand aodifiar. An 
explicit control algor itha is derived and tasted/ iaplaaantad for an experimental path: a 
straight line in tha vertical direction. However, tha question on how to define the 
prediction function, tha transformation, and tha modification function is left open in tha 
paper, and tha dynamics of tha arms is excluded from tha algorithm. 

Whan two robot arms work on an object certain constraints must be satisfied in order to 
carry out a smooth, coordinated operation. Zheng and Luh [7] have derived a sat of holonomic 
constraints on positions and orientations of tha and effectors for two robots in three 
specific working conditions, namely, handling a rigid-body object, handling a pair of pliers, 
and handling an object having a spherical joint. The result is extended to the constraints 
between joint velocities and accelerations of tha two robots for tha three above mentioned 
cases [8]. 

Considering tasks of transferring an object by holding it with two robot arms, Lim and 
Chyung [9] introduced a position control method using kinematic relations between the object 
and tha two robot arms. By first specifying tha trajectory of the object, the differential 
changes of each robot hand are computed from the differential changes of the planned path. 

The commands or differential changes of each joint of the two robot arms are generated by 
applying the inverse Jacobian matrix. The method is simple but applicable only when tha 
involved motion is very slow, /round and Hoyer [10-12] proposed a hierarchical control method 
for collision avoidance in multi-robot systems. The method adopts a hierarchical coordinator 
and is systematic. However, an algorithm is needed to design the couplings among robots. 
Vukobratovic and Potkonjak [13] described a method which can be used to obtain the closed 
chain dynamics of two coordinated robot arms. However, the reaction force and reaction moment 
between the two arms are retained in the final equations. Hayati [20] extended the idea of 
hybrid posit ion/ force control to the multi-arm case. Based on equations of motion for a 
multi-arm system, which are derived in a constrained coordinate frame located at the grasped 
object, a controller is designed to cooperate n robot arms such that the load is shared among 
the arms in a non-conflicting way. A minimization of the magnitude of forces and torques is 
performed to decide how much each robot arm should contribute. It appears that the existing 
coordinated control methods fall in lack of either systematic synthesis of the control system 
or full consideration of robot arm dynamics. 

In this paper we concentrate on the application of nonlinear feedback to the control of 
multiple robot arms. Previously we derived a general algorithm for the control of a single 
rigid robot arm through nonlinear feedback and state transformation resulting exact system 
linearization and simultaneous output decoupling [15,16]. Our control design technique 
elevates the robot arm servo problem from the joint space to the task space with three 
important consequences, (i) On the joint level our scheme computes and commands drive forces 
or torques on their actuator-equivalent quantities (current, voltage, pressure), (ii) The 
robot arm system in the task space is considered as a linear system, and the powerful tools of 
linear control theory, including optimal control, are applicable to robot arm controller 
design in the task space. (iii) Our controller can directly respond to task space commands 
provided that these commands are formulated in form of closed time functions. The question 
discussed in this paper is: how can our control method be applied to the control of multiple 
robot arms. 

We are discussing two modeling approaches. In tho first approach, we model the multiple 
arm system as a single system, that is, as a closed loop kinematic chain. In the second 
approach we retain the single arm models, but we introduce task constraints and force-moment 
measurements in the control scheme. The paper concludes with a brief discussion of 
computational architectures that are needed to implement our control technique for the control 
of multiple robot arms. 

3 . Closed. Chain Formulation 

As the first approach to coordinated control of multiple robot arms, we consider the 
multiple robot arms as a single mechanical system consisting of kinematic closed chains. For 
tasks of lifting a heavy workpiece using robot arms, two or more robots are required if the 
workpiece is out of loading limit of any available robot arm. Suppose that m robot arms are 
used in such a task and that they all grasp on the same object (workpiece) in order to lift 
it, turn it, etc. Our primary concern is to obtain a dynamic model of these robots for the 
control purpose. Since they grasp on the same object, the dynamic behavior of one robot is 
not independent of the dynamic behavior of the other robots any more. A unity of mechnical 
system is rather formed by the robot arms involved and by the grasped object. 

We will derive the Lagrange's equations of motion for this mechnical system. Those 
equations will serve as a model of the system to design control algorithms. For the m robots 
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of consideration, vs nans them robot X, robot 2, and robot m , respectively. Wo assume 

that robot i has n A links. Wo also assuse that each robot firsly grasps the object so that 

there is no movement between its end effector and the object. Closed chains are formed in 
such a configuration by the m robot arms, the object, and the ground. Wotiee that the object 
and the last links of the robot arms become a single link. From the Kutzbach-Grubler 
criterion [17], the degrees of freedom of a spatial linkage structure connected by joints with 
each joint possessing one degree of freedom are given as follows 

p - 6<i-l) - 5j U) 

where i is the number of links and j is the number of joints. This formula reflects the fact 
that each moving link has six degrees of freedom and the fixed link (the ground) has none, and 
that each joint of one degree of freedom causes a loss of five degrees of freedom for a link. 
For our case of m robots, the degrees of freedom of this entire mechanical system is than 


m am 

p - «[ z (i^-l) + 1] - 5 l !U - t n k - «m + 6 
k-1 * k-1 * k-1 


U) 


where is the number of links of robot k. If three robot arms are involved to perform a 

task, Table 1 shows 10 different combinations of three robot arms with five, six or seven 
degrees of freedom. 

Before proceeding, let us define some notations that will be used in the rest of this 
section. 


q 1 - cej ej ... ej ]• 

0 - [(0 1 )' (0 2 )' ... (0")']' 
<j - tq x q 2 ••• qp] ' 

T - [t x t 2 ... T p ]’ 

f 1 - C r{ rj ... f L n y 
F - [ (F 1 ) ' (F 2 ) ' ... (F*) ']' 


: joint variables of robot i 

: joint variables of the mechanical system 
: generalized coordinates 

: generalized forces corresponding to q 
: joint force/ torque of robot i 

: joint force/torque of the mechanical system 


n - + n 2 + ... + . 

The generalized coordinates q can be chosen arbitrarily as long as they are linearly 
independent of each other. They are functionally related to the joint variables 0. We denote 
the relation by 


q - Q(9) . (3) 

Knowing the generalized coordinates q, the configuration of the mechanical system, thus the 
joint variable 9, is uniquely determined. We denote such inverse relation by 


9 - 0(q) 


(4) 


with the above notations, the Lagrange's equations of motion for the mechanical system are 
described by 


, ia_ ,• , ill 22 a + ill 

( J ( .2 3 q q + .2 

x 36 36 


3 2 0, 


4 . 


■>q 


3 2 0 


aq 

i - 1, 2, 




3q 


3930 


30 . • . 

D<J i 


( 5 ) 


where L is the Lagrangian of the whole mechanical system. Equation (5) is a generalization of 
the equations of motion of two robot arms presented in [14). 
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w* *saign • ooordinat* (rut to each link of ovory robot ora. Wo looat* a world 
coordinate froao in tho nn—nn work apaeo of tho ■ robot*. Zn tbo prooooo of axpraaaing ttao 
klnotlo and potantlal anargle* of tb* aoebanloal ayataa, w* divide tb* aaaa of tb* object Into 
a part*. Uah robot la roaponalbla' for on* part of tba object aaaa by adding It to tb* aaaa 
of tb* laat link. After carrying out tb* derivation* of tb* Lagranglan function, v* obtain, 
tb* dynaalo aquation* of tb* mechanical ayataa 

D(q)q ♦ I(q, 4 ) ♦ O(q) - T («) 


wber* 


0 (q) - D (e(q)) J Q 

j ■ -ifi- 
0 aq 


D(0) 
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o* J 


D r - (D^j ) la the inertia aatrix of robot r 
n r 3TS 


'ij 


Z Trace ( — ®r“ IJ. 
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E i " is the coaffici * nt of centripetal (j-k) or Coriolis (jjOc) force of robot r 
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3Tf 


Trace ( 


r 3 (^)’ 
x; — ! — ) 


E ^ lc " *-aax( 1, j,k) ' 30^30^ "* 30* 
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is the gravity force of robot r 



In the abova dafinitiona, T* - A* 2 ... A^ i-1)i# where A^ ia tha Denavit-Hartenberg 

homogeneous transformation matrix from coordinata frama i to coordinata frama j of robot r? * 

ia tha mass of link i of robot r; r£ la tha aaaa cantar of link i of robot r; ia tha 

• pseudoinertia matrix of link i of robot rj g ia tha accalaration of gravity, dafinad to ba a 
4x1 column vactor with tha last componant baing aqual to zero. 

Equation (6) charactarizaa tha dynamic bahavior of tha whole machanical ayatam. However, 
this aquation ia nonlinaar, couplad, and complicatad. It poaaa graat difficulty in controllar 
daaigna. Wa propoaa to linaari za and output dacoupla tha ayatam (6) using a nonlinaar 
faadback and a nonlinaar coordinata transformation. Lat ua introduca a atata spaca variabla x 
by sotting 


x i " q i 1 x i+p * 4 i * 1-1 * 


x 1 - tx 1 * 2 ••• x D r , x 3 



Tha dynamic aquation (6) can ba vrittan as 



X 3 


0 

* - 

-D* 1 (x 1 )[E(x 1 ,x 3 )+G(x 1 )] _ 

+ 

D -1 (X 1 ) J' 

L 0 J 


- r(x) * g(x)F 


2 , 


- t*, 


p+i 


X 2pJ 


(7) 


Ho taka tha position (orientation) of the object handled as the ayatam output 

y - Mx 1 ) - [h x (x 1 ) hjfx 1 ) ... h^x 1 )]*. (8) 

For tha nonlinaar faadback, the so-called decoupling matrix ia [15,16] 

A(X) - Jvfx 1 ) D~ 1 (X 1 ) J' 

where is the Jacobian, matrix of h. The nonlinear feedback has tha form 
F - *(x) + a(x) u 

where i(x) and i(x) ara determined from the following two algebraic equations [15,16] 


A(X) 

Mx) - -L 3 h 

C9) 

A(x) 

3(X) - Y. 

(10) 
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m* 
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mm: 




Xn the above equations, tjh la the second ordar Lia darivativa of h along f , 

»* •. * 


Y - 


o 


L 


O' 


■ [1 i • • • 1] ia a lxa^ naw vactor with all antriaa aqual to 1 and i-1, . .. p, ara 

n 


ehoaan auoh that > 0 and m j + » a + • . . + 


The indax m^ ia aaaooiatad with tha fact 


that a total number of n indapandant aotuatora (inputa) ara to ba dividad into p groups to 
oontrol p outputs. Tha raquirad nonlinaar ooordianta transformation is givan by [13,16] 

♦(x) - [h x L f h x ... h p L £ h p ]* • 

Sinca both aquations (9) and (10) ara underdetermined, thara ara infinita many solutions for 
them. Any solution sarvas tha purposa of linaariaation and daooupling provided that B(x) is 
invertible. A solution to aquation (9) is givan by [18] 


0<x) - - A + (x) L*h(x) 


( 11 ) 


wh.r. A + - A'(AA')" 1 1. th. 9*n.raliz.d lnv.ra. of A(x). Th. gan.ral solution to aquation 
(10) ia [18] 


8(x) • A + (x) r + (I - A + A) H 

vhara K ia an arbitrary matrix which ia to ba chosan to asks 8(x) invartibla. 


( 12 ) 


Aftar applying tha nonlinaar feedback and tha nonlinaar coordlnata transformation, tha 
original system (7) with output (8) ia convartad into tha following linaar and dacouplad 

system 


t « Az + Bu 
y - Cz 


(13a) 

(13b) 
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Note that tha obtainad linaar systam (13) consists of p indapandant subsystems. Tha control 
problam of tha vhola mechanical systam is than simplified to a design problem of individual 
subsystems. Tha ith subsystem is defined by 
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(14a) 


V i - Cl 0] 


*2i-l 

*21 


» i * 1| • • • / P 


(14b) 
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where u* is ths ith group input with m^ components. To stabilise ths subsystsm (14 ) , wo 
introduce a oonatant feedback u* ■ - «* + v* with 


where a 1, - t« 2 i. x » 2i ) ' , and v* is ths nsw reference input. With suoh a constant feedback, 


subsystsa (14) bseoass 


- k ii "*ia 


][::i • t:j- 


y x - (i 


01 [ ’T] 


i *■ 1# •••» P# 


or in compact fora 


l 1 - Z i + B a V 1 


Y i - C 1 * 


whsrs A x can bs easily identified froa aquation (15a) . For ths above systea (IS), the damping 
ratio F , and the natural frequency <*> n are related with the feedback gains by 


°n “ *il 


3 5 “n " lt i2* 


We now consider equation (15) as the new mathematical nodal of the real systea which is 
exactly linearized, output decoupled and stabilized. The desired (nominal) input to each 
subsystem can be derived froa the following system 


i "if” z 2i»i r ° "i i 

31 1 (v 1 ) 

■ k i2 J L * 2i J L 


y“ - [i o]j^ d J , i - i, p (i6b) 

where the superscript "&* indicates "desired" quantities. From equation (16), the desired 
input can be obtained in terms of the desired task space trajectory. 

Y i (vV « vi + k i 2 + *ii y? * i - 1, p. ( 1 7 ) 

It is observed that the left hand side of equation (17) is the sum of m x inputs in task space 

computed from the planned trajectory. For a given planned trajectory, at any instant time the 
right hand side of equation (17) is a given value. Applying the generalized inverse, we 
obtain [18] 

(v 1 )* - y ± ( y yJ)’ 1 (y x + k i2 + k n y x ) . (1*) 

Hots that in our control dssign msthodology ths actual control vsetor is ths task spacs 
command as formulatsd by aquation (17). On the joint level, our methodology computes drive 
forces or torques for the individual actuators, and the servo design is on the task level. 

Let the output error be defined as follows: 
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•m-[ 


y 4 -yj 

*l-*l 


vhura and ^ * r * tha raal (aaaaurad) valuaa, and y£ and fr* *** tha daairad valuaa. 
aliainata tha output arror a^, va utllisa an optlaal arror oorr acting control loop by 
■iniuiilng tha following ooat functional 


«o 


( T 1 1 

J - \ [( AvV R + a 1 (t)' Q a 1 (t) J dt + ^(T)'* a 1 (T). 


Tha optlnal corraotlon la given by 
Ay 1 - -R _1 B[ P(t) a A (t) 


(19) 


where P(t) is a positive definite solution of tha Riccatl aquation 


P(t) - -P K t) - Xj P(t) + p(t) B l R _1 B| P(t) - Q 

r<t) - s. 


with 


x i 




Tha ovarall atructura of tha controllar daaiqn ia dapictad in Figure 1* 

4 . gflica Control Approach 

In this approach, va conaidar tha dynaaics of aach robot separately, but va poaa 
constraints on tha dynaaic aquations by introducing tha intaractiva forca and intaractiva 
sonant aaong tha robot arms. 

Wa hava proper ad a forca control approach to tha coordination of tvo robot araa 
parfornlng a singla task [19}. Tha coordination batvaan tvo robot araa ia achiavad by 
monitoring tha intaractiva forca and aoaant at tha and effectors. Nov va axtand this aathod 
to aulti-ara casa. 

Suppoaa that a robot araa (a > 2) ara working on an object, e.g., lifting or turning a 
heavy workpiece. Tha problaa va ara dealing vith is to find a control algor itha for a robots 
such that tha task is performed in a coordinated fashion. Wa aaauaa that aach robot has a 
forca (torque) sensor installed at its and effector. Using forca control approach, tha 
coordination aaong a robot arms is realised by regulating tha forca and aoaant applied to t. 
object by aach robot, with tha aid of proper task planning, a robot araa ara able to move 
a non-conflicting way. 

Tha dynaaic aquations of a systaa of a robot aras ara given as follows i 

D^q 1 ) q 1 + E^q 1 , g A ) ♦ J^q 1 ) F 1 - t 1 , i - 1, 2, ..., a 


where q* ia an n^-diaansional joint variable vector of robot 1, n^ is tha degrees of freedoa 
of robot i, F* is an n^-dlmensional vector of the forca and aoaant aeasureaento of robot i, 
t* ia an n^-diaensional joint torque (forca) vector of robot i, and is tha Jacobian aatrix 


of robot i. 

Nov va introduce a state variable x by letting 
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in )' - q l . 
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** - f*vi x "i*«a r ' ••• 

** - t*v- + vi +l X " J ‘ " cq i ••• “ •*"' 

** " t* n+1 • •• * n + ni i' ■ t$i ••• ^ 

*^ 8 " t*n + n l+ l *n* V n 3 l* ‘ Wj ... 4^1* - 4* 


* 2 * “ ^ x n+n l +. . • + n > _ 1 +l ••• *an 3 ' " t4 l *** ^ ' 


whara n - n^+ + ... + n^. Than x is a 2n-diasnsional vector partitionsd into 2a blocks 


x - [x x x a ... x n x n+1 ... x 2n ]‘ - 


r x 1 -» 


x" 

x- +1 

2m 


x * [x^ • • • x n ] 

with ths first a blocks (corrasponding to ths first n coaponsnts x) raprasanting ths joint 
positions of a robots and with ths last a blocks rsprsssnting ths joint valocitiss of a 
robots . 

Ths dynaaic aquations of a robots can now ba writtan in tsras of stats variabla x as 
follows: 
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or * - f(x) + g(x)r (20) 

whsrs f ang g can bs aasily idsntifisd from ths abova aquation. Wa taka ths output aquations 
of ths fora 


187 




r h 1 - 


p 1 ♦ wi r l “| 

P * 


h a 


nj P 2 ♦ wj H 

y - h(xj - 
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_ h* . 


_ P* + F* . 


where Mp, i - 1, . .., m, are the weighting matrices, and p A is tha position and 

orientation vector of robot i in the world coordinate frame. The dimension of output vector y 
is n. 


Equation (20) represents a nonlinear and coupled system with output (21). Using e 
nonlinear feedback t ■ a(x) + 0(x)u and a nonlinear coordinate transformation T(x), we are 

able to linearise and output decouple the system (20). The a(x) and 0(x) in the nonlinear 
feedback are given by 


a(x) - -A -1 (x) tj h 


(22) 

8(x) - A" l (x) 


(23) 

A(X) - g* Q . 

0 

e 


The nonlinear transformation is given by 


T(x) - 


L r h i 


L f h n 


(24) 


Application of the nonlinear feedback and the nonlinear coordinate transformation converts the 
system (20) with the output (21) into the following linear and decoupled system 


i - Az + Bu 
y - Cz 

where 

z • [z x ... z 2n ] • , u « [u x ... u n J» , 


A - 

i 

o * 

y ‘ o 

----- 1 

9 B - 


u n j 


A i-[ 

-° i -I 

_ 0 0 J 

• ■»-[:] 


(25a) 

(25b) 


y - (y x 

•** y n J ' 

9 

0 

• 

a 

o 

Cl . 0 

e 

B n _ 


° -cj 


C L m (l o] 9 i"li 2 , . n. 
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Hot® that ay stas (25) consists of n indapandant subsystems. Likewise as in tha closed chain 
formulation, for aach subsystaa va can design a constunt faadback to stabiliza it and dssigrn 
an optimal arror-corracting loop to alininats tha output errors. Tha ovarall control lar 
structura is shown in rigura 2. 

5. Canglmion 

Our approachas to tha control problaa of multipla robot arms ara motivatad by tha dasira 
of making rigorous usa of tha dynamics of robot arms involvad in tha task. Tha closad chain 
approach is initiatad from the fact that tha dynamic bahaviors of tha robot arms ara not 
indapandant of aach othar any mora if thay grasp on a common object. In this approach, tha 
multipla robot arms ara sodalad as a singla machanical system by choosing a sat of ganaralisad 
coordinates whose number equals tha number of degrees of freedom of tha whole system, rigura 
1 shows tha schematic structura of tha controller for tha closad chain approach as implemented 
on computers, rrom tha initial physical task, tha task planning of tha upper left block in 
Figure 1 produces a trajectory in tha task space expressed as a smooth function of time. Tha 
command generator block realizes aquation (18) and yields tha desired reference input. Tha 
lower left block is tha implementation of tha optimal error correction described by aquation 
(19). It takas tha task space error as its input, and produces tha optimal correction as its 
output. Tha Q( 6) block to tha right of tha multipla robot arms establishes tha ganaralisad 
coordinates as wall as their time derivatives from tha measured joint positions and velocities 
of tha robot arms. Tha bulk of tha controller is the nonlinear faadback block which computes 

tha joint driving torques or forces. Because tha dynamic projection functions D^, E^, and G* 

ara derived in terms of tha joint variables, it may be convenient to usa tha joint variables 
in addition to tha generalized coordinates for computing tha nonlinear feedback. 

Different from tha closad chain approach, tha force control approach assumes that aach 
robot arm has a force and moment sensor located at tha and effector. Tha force and moment 
measurements are introduced into the dynamic equations and output (task) equations. This is 

schematically depicted in Figure 2. The measurements F 1 , F 2 , ..., F® are transmitted to the 
nonlinear feedback block, the output h block, and the coordinate transformation T block. The 
three blocks to the left of the nonlinear feedback block in Figure 2 are structurally similar 
to those in Figure 1. 

Using the results from differential geometric system theory, we are able to linearize and 
to decouple the complicated dynamic equations of multiple robot arms including the object held 
by the arms. Independent of the approach being taken, we eventually deal with a linear and 
decoupled system. Thus we can have a unified design technique for coordinated control of 
multiple robot arms. 

It should be noted that both methods used in this paper are systematic and are robot arm 
independent. The most important feature is that the control algorithms are task independent, 
that is, there is no need to change the structure of the controller or even the parameters of 
the controller from task to task. As natural as would be, the change of tasks only causes the 
adjustment of the input command which is conveniently given in the task space rather than ii. 
the joint space. The two control methods can be used in slightly different situations. For 
example, if the robot arms are loosely connected through the object, the force control 
approach is preferable; if the robot arms are mechanically locked while transferring the 
object, the closed-chain approach is more likely a solution. 

Each control scheme naturally leads itself for computational implementation using 
distributed computing system, possibly in multi-bus architecture. Figures 1 and 2 provide a 
high level structure of computational implementation requirements. The details of the 
implementation require a deeper analysis. 
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!■ Ifcll HBf » |im»U U«kal<M/ifor ooatrol 1 lag aaltlplo aaaipalatora 
vUok are koldlag a iligli ek/i«t i>4 tkinfori fora a oloeed kiaaaatia oka la. 

Tka object, vklak aay or aay aot bo la ooataot vitk a rigid oavlroaaoat, Is 
aaaaaod to bo bold rigidly by a robot ead~offeetora. Tko derivatioa la baaod oa 
sottlag ap ooaatraiat oqaatioaa vklok rodaco tko dxa dogrooa of froodoa of a 
aaaipalatora oaok kavlag all joint*. Addltloaal ooaatraiat aqiftlaai art 
ooaaldortd vkea oao or aoro of tko dogrooa of froodoa of tko objoot la rodaood 
dao to oxteraal ooaatralata. Utlllalag tko oporatloaal apaoo dyaaaloa 
oqaatioaa, a doooapllag eoatrollor la "aaigaed to ooatrol botk tko poaltioa aad 
tko latoraotloa forooa of tko objoot vltk tko oavlroaaoat. Fliilly^-vaapffmi: 

•iaalatioa roaalta for tko ooatrol of a pair of tvo-liak aaalpalatora^v^c^ — 4 

1. INTRODUCTION 

Tko toplo of aaltlplo robot ooatrol ia relatively aov la robot lob re tear ok. Tko exteaaloa of robot ooatrol 
toekaiqaoa to tko oaao of aaltlplo aaaipalatora la aoooaaltatod by/real Itiee oaooaatorod botk for aaaipalatiag 
aaall objoota aad for kaadllag largo vorkpioeoa. Tko aaaipalatjroa of objoota aoraally roqalroa at loaat too 
kaada to a iaal taaooaaly poaltioa aad roorloat tko objoot ao tkat oltkor oao or botk kaada oaa porfora tkoir 
respective taaka. loro, tko oaployaoat of aa extra ara la aired aot by lialtatioaa oa foroo/ torqae oatpat of 
aa ara bat by tko foot tkat la ooaplex took exeoatioa, tko jdorfcpleee aaat froqaoatly bo roorloatod to aspoao tka 
kard-to-reaok areas. Ia foot, aoao experts have a agar's ted tkat robota vill aot traly bo latagratad ia tko 
fatara factory oavlroaaoat aatil aaltlplo robot tap* plaaalag aad ooatrol have raaokod a oortala level of 
aatarity. Ia tkla paper vo vill oaly address tko eeptrol problaa aaaoolatad vltk aaltlplo aaaipalatora kaadllag 
a ooaaoaly kold objoot. Otbor probloaa, aaok ae/deveral aaaipalatora vorklag ia tko aaao vorkooll vltkoat 
aaaipalatiag tko aaao objoota aiaal taaooaaly, or/two area vorklag oa tko aaao objoota bat kav lag relative aotloa 
vltk roapoot to tko vorkpiooo, are laportaat rpdearek toploa aot addreaaod ia tkla paper. 

Barly vork la tkla area (1, 2) kaa aodtly dealt vltk aaator/alavo ooatrol tookaiqaoa. Tkla approaok 1a 
coaeeptaally aiapla. Oao aaalpalator \jr poaltioa aorvood vkllo tko otkar aaaipalator la foroo aorvood. Tko 
force aorvood era ia ooatrollod la oomrfiaat aodo to follov tko aaator (poaltioa aorvood) ara. Addltloaal food 
forward forooa (torqaoa) are added do tko foroo ooatrollod ara to roaliao tko roqairod latoraotloa forooa 
botvooa tko trai aad tko objoot vklok ia bo lag aaalpalatod. loro roooatiy, aaoag otkere. Lak aad Zkoag kavo 
reported tkoir vork la daal ara opardiaatioa. Vo will aoatioa oaly aoao of tkoir vork kero. Ia roforoaoo 13 J 
tkoy diaoaaa tko kiaoaatio Isaacs related to tko ooordlaatioa of daal aaaipalator robota. Tkoir approaok la 
baaod oa poaltioa aorvoiag of/oae of tko araa aad tko oao of kiaoaatio ooaatraiat oqaatioaa to aodify tko 
trajectory of tko eeeoad ara to roaliao oooporatioa botvooa tko araa. Ia roforoaoo (4j tkoy extead tkoir 
oaalyaia to iaolado tko eoadtraiat oqaatioaa for tko Joiat aceelerat ioa. Tbea, aaiag tko ara Jaooblaa. jolat 
torqaoa are eoapated to dqdivo tko araa. 

Ia tkla paper vo derive tko oqaatioaa of aotioa la tko ao-eailod Oporatloaal Spaoo (or Cartoaiaa atato 
apaoo) (Si. Vo aaaaao/a goaoral oaao of a eooperatlag robota vklok are koldiag aa objoot rigidly. Tkla object 
aay alao bo coaatraided froa aotloa la oao or aoro diaoaaloaa by aa oxtaraal oavlroaaoat. Bqaatloas of aotioa 
are derived aaiag tko Lagraago aaltipllor taokaiqao. Koforoaooa [41 aad r?) provide oxoolloat backgroaad for 
tkla foraala t loa/ It ia aaaaaod tkat oaok aaaipalator ia oqaippod vitk a foroo/torqao aoaaor eapablo of 
aoaaariag tkroo /rtkogoaal forooa aad torqaoa ia a glvoa eoordiaato fraao. Tko ala ia to ooatrol tko poaltioa 
of tko objoot add ita latoraotloa forooa vitk tko oavlroaaoat ia tko eeaee of hybrid ooatrol (tl. Tkla paper 
oxtoada tko prdvioaaly reported roaalta ia (9) by a aoro ooapaot foraalatioa aad by explicitly eoepatiag tko 
iatoraotioa forooa botvooa tko robota. Fiaally. a alaalatioa atady for a pair of tvo-liak eooperatlag 
aaaipalatora' ia proaoatod to validate tko aaalyaia. 

/ 
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monies or multi fl* cooruATma bosot iunifuutoss 


tm Uti MitiH, tt vill tki 4;n«i«i miiIUii far a «!om 4 Uiiaitii aha la forasd by a 

aniriliUfi rl|Uly intyiii u objiit. !• vill iniai that tosh aaaipalator has ais joiats aad that aoaa of 
tha aaalpalators axpariaaaa slagalarlty (i.a.. dogaaarata aaalpalatoi t Jaaobiaa) aa Uty favfora a task. Tha 
inifii ahjaat uy ha la aaataat with a rigid aaviroaaaat. 

la dariviag tha ifuttaai of aat ioa, va vill aaaaaa that thara aaiata a aoordiaata fraaa vhiah ia sttaahad 
ta tha ahjaat. Ia fait, ia tha matt saatloa, va vill aaa thia aoordiaata iraaa ta iHiify tha doairad aatiaa/ 
forao af tha ahjaat. Figaro 2.1 thavt a aahaaatia dravlag af thia aaltipla aaaipalator ayataa. 



Figaro 2.1. 

Tha darlvatloas of tha aqaatioas of aotioa ara ooaaldarably simplified vhaa i) va vaa tha Cartaaiam atata 
dyaaaios aqaatioas [3.10], aid h) va lmap tha ohjaat aass/iaartla imto that of tha aisth llmk of tha araa. 

Lot aa bagia vith tha vail kaova aqaatioa of aotioa for a aiagla aalti-liak ara [9,10] 

M(g) 1 ♦ X<g.g> ♦ 0(g) - t «•!> 

vhara g C t A daaotaa tha joiat atata vsrlabla. M(g)Gk“ a la tha iaartia aatris vhiah ia ayaastrla aad positive 
dafiaita, y(g,g)C* A ia tha aaatrifagal aad Corlolia foraa/torqaa vaotor, fl(g)€k* it tha gravity vector. aad 
xGt B la thajotat foraa/torqaa vaotor. Ia ordar to siaplify tha vordiag af tha papar, va vill aaaaaa that all 
tha joiata ara ravolate aad haaoa aaa tha tara "torqae" vhaa rafarriag to gaaaraliaad Joiat foraa/torqaa 
veetora. Tha ahova aqaatioa appliaa oaly to idaalisad friotioalaaa rigid araa. 

Ia tha first part of tha derlvatioa. va vill ooasidar aa object-fixed aoordiaata frame ralativa to vhioh 
tha dyaaaies aqaatioas vill ha vrittaa. fa will also aaaaaa that tha ohjaat haa baaa partitioaad iato a 
parts. Baah of thasa parts vill thaa ha ooasidarad aa a part of tha last llak of aaoh ara. Figaro 2.2 
illastrataa oae of thasa parts togathsr vith tha last llak of ara l. 


/ 


/ 

\ 

W*M, 



Flgara 2.2. Schematic draviag of tha partitioaad load aad its 
iatagratioa vith tha last llak of ara 1. 

If *J| is tha psaado- iaartia aatris of tha last llak raprasaatad la tha 1 ooordlaata fraaa. its rspresaata- 
tloa la tha B fraaa is ohtaiaad froa: 




last llak 


•li V -i 


( 2 . 2 ) 


Siaoa tha aras ara ooasidarad to ha rigidly attaokad to tha load, thara is a ooaataat traasforaatioa ralatiag 
tha i aad B fraaaa, |t. It thaa follows that 


- 


i - r f* ‘Xi) 1 - ?T 4 t 

J last llak 


(2.3) 


vhara E J i is tha pseado-iaar t la aatris of tha lost llak of tha ith ara asprassad ia tha E fraaa. Lat as aov 
assaaa that is tha psaado- iaartia aatris of tha ith partitioa of tha load. This aeaps that 
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T 

t 

i 


(2.4) 


h 


«“ J 


IMtitiM 1 




wkiri fiftlftin i U ii ilivi MkiiitUiUy Im Fi|tn 2.2. Util It lag i(iittiii (2.9) ia4 (2.4), »• m mow 
4 i(1h a mow laat llmk for tki Ilk ua wkiok la ossoatiallp tki ooakiaatloa of tki origiaal last limk plat a 
port lorn of tko loaf. Alto moto tkat tko ooordiaato fraao of tko laat llmk la mow B iastoad of tk# fraao L 


Althoagk Im prlmolplo oma aaa ot# Jolmt tpaoo dpaoaloa oqaatloaa for tko apatoa of amltlplo maalpalatora. 
tko dorlvotiom la tomtidoraklp tiaplifiod wkoa tko sw-oallo4 oporatlomal tpaoo foramlatiom (or Car tot lam atato 
tpaoo dpmaaiot) it mao 4. Tko rolatlomaklp kotwoom tko Joimt tpaoo am4 tko Cartoaiam tpaoo foramlatiom oam ko 
dorirod aiaplp ky at ills imp tko mamlpmlotor laookiaa at [5,10] : 

i - 1(A) A <>.5) 


wkoro 1 it tko aamipmlatwr laooklam am4 j it tko poaltlom/orlomtatiom of tko loot limk oowrdimmta fraao. Tko 
laookiam ato4 koto It a g amoral omo, a oam imp tkat omo oam oomaldor it to ko 4oflmo4 rolatiro to tko laat fraao, 
tko worl4 fraao, or im foot amp otkor 4oairo4 fraao. Tkklag tko tlao dor laat loo of (2.5) roamlta im 

1 - 1(A) ft ♦ 1(A) 1 <2.4> 

Bmkotitmtimg 1 froa (2.4) lato (2.1), wo oktalm 

«(,) r^a) (X - ia) * Ka.a) ♦ fl(a) - s 

or 

“*<A> X ♦ I s <ft,£) ♦ fl,<A> “ t (2.D 

wkoro 

*,(*> " rT <2> *<i) rl <a) (2.D 

Y,<a.a> - J~H Ka.i) - *<a> J -1 **) i<a> a) <»•»> 

fl«(a> - J" T (a> 9 <a> (2,io> 

la oqaatloa (2.7), £ roproaoatt am oqairaloat goaorallstd foroo wootor appliod at tko oad-poiat (B-frtao). Lot 
as doaoto kp 1 tko latoraotioa goaoralisod foroo wootor oa oao of tko robot arat at tko B-fraao. Tko oqaatloaa 
of aotloa aro tkoroforo 


*ii<Ai> 1 ♦ X.i<Ai»fti) ♦ fl*i<ftl) - Ei + Xi . 1 - 1 » <2.11) 

Slaoo Xi' s th# lmtormal forooa oroatod at poimt B. tkoir tu la soro. 


a 

i Xi - o 
i-i 

Tko oqaotioa of aotioa, tkoroforo, a ap bo obtalaod bp oiaplp add lag tko oqaatloma (2.11) 

a a a a 

t 2 N, A (A ft )] X ♦ I Y.i^i.fti) ♦ I flsi<Ai) • I Ei 
1-1 1-1 l-l 1 1-1 

la ordor to aiaplifp tko aotatloa, lot as doflao 


( 2 . 12 ) 


*.<9> - j, "«l<*i> 

- Jj Y,i<4i.ii) 

9»<9> - j, > 

* • ill *» 

wkoro 

9 ■ ( *F. flj «I> 

Now. .qutio. (2.12) e» b. vritt.a i. tk* tiapl.r fora 

“,(9> X ♦ Y,<9.9) ♦ fl,(9> - E * 


(2.13) 

(2.14) 
(2.13) 


( 2.10 


* 
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Net* that N. it Mill a te.Ml.e d.fieit. aatrla. Tfc# Lt.raetlee |.a.r*U..d ferae*, l.e. Ij’e. *• 

•Miaul by aelelag fer 1 la eqaatlea (2.10 aa4 .mb.titati.f tk* reaalta la (2.11) 

Xi - n,i(«i> « - is<«.a> - s«wi 

♦ *«i<ai*ii> ♦ fl.i<*i> - Ei . » - » ■ «•«> 

Zm luairy, if tki iBiUldul JoUt torqaoa are |ltu, om on oospito tki otiUiloit Ei ## ^ atiliaiag (ko 
Jaeobiaa aatrloas. Bq aotioa (2.10 on that ko mi to a© loo tko forward dymasioa of the aaltiple ara ay a too* 
Finally, tko iattfootioo fore# a eaa ko eoapmted froa oqaatioa (2.17). Implicit ia tko above derivation art tko 
ooaatraiat oqaatioaa 

I * *i<Ii> • *2<«2 ) * *n<*> 

vkoro H 4 ia tko forward kiaoaatiea rolatioa for am i. 


s. tmomics of nultiiu ooofbxatino wtmcrr uMmumm * with sxtbxnal oonstkaints 

la tkia aootioa vo will derive tko oqaatioaa of aotioa vkoo tko object tkat ia boiag kol4 by a robot 
■aaipalatora ia ia ooataot vitk a friotioaloaa rigid eaviroaaeat. Lot C bo a ooordiaato fraaio oa tko objoot to 
roproaoat tko goaoraltso4 ooaatraiat forooa, Ilia ooordiaato fraao, ia general, will bo 4iatiaot froa tko B- 
frame. Witk a sligkt aodlf ioatioa. oqaatioa (2.14) oaa bo traaaforaod ao tkat tko gar rallied foroo vector l 
jo aa bo ropreaoated ia tko C fraao. If $T ia a koaogoaooaa traaaforaatioa tkat relat«a tko C fraao to tko B 
fraao, tkoa gj eaa bo obtaiao4 to rolato tko goaoraliso4 foroo veotora ia tkoao two fraaoa. Tko Jaeobiaa aatria 
§J ia given froa 

c* B> oORO x c* 

- (3.1) 

• &» J 


T . 


Ia goaoral, aiaoo tko C-fraao oaa bo tiao /arylng, and ooaaeqaently ^7 aro ala 


x doaoto tko poaitioa/orioatatloa of tko C-fraao. Tkoa 


S, ■ ? * ♦ B 1 * (l - 4) 

aad banco oqaatioa (2.16) ia 

«„<9> 31. ♦ Y„<9.9> ♦ 9 ie <9> - E, <3. 3) 

wb.r. M.., V 0.. era d.fla.d by . ..t of a.trix .qa.tioa. <?.»> - (2.10) with I r.p l.o.d by £/ *.d 

2 0 *C XC d 

E. * <§J) T E- 

Lot aa oasaao that i(| fl )Cl B , a X 6, roprosoata tko ooaatraiat foaotioa amok tkat 

q(j c ) - 0 V t 2 0 (3.6) 

If vo doaoto by £ 0 tko goaoral ixod coaatraiat forco vector, tkoa tko oqaatioa of aotioa la 

»,,<9> 2e ♦ Y,.<9.9> ♦ fize<9> ■ E« ♦ I. (}•!) 

Tko ooaatraiat forco f 0 oaa bo obtaiaod by noting tkat tko virtaal vork porforaod by X 0 it aqoal to soro 
aad by aaiag tko Lagraago anltiplior aotkod aa diacaaaod ia (61 aad (7). 


Tko atatoaont of virtaal vork ia 
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Proa tho ooi 


L«tioi (3.1) 


*#(*,) 


•l t - 0 


(3.9) 


Vo oao now net tho Lograage multiplier XC»" to relate (3.1) and (3.9) at 

_ _ **<*•> 

tij - X T 1 »» 0 - 0 

*1. 

Siaot ia aa arbitrary laflaitaalaal displacement vector, It la mot equal to soro. Therefore 
I, - D t <I 0 > X (1.10) 


vboro 

°<2') - **(«„> / 6l 0 (3.U) 

To f lad X. vo differentiate (3.0 with reepeet to tlao tvlao 

i(l 0 ) • D(i 0 ) J 0 - 0 (3.12) 

i(l^) - i(l*) i 0 ♦ D(I 0 ) x fl ■ o (3.13) 

amd •mbatitmta for X« froa (3.7) 

®<j«> i. ♦ d(i.) i e, ♦ » T <*.> * - is. (fl.fi) - fl ss (8> i • o 

•kick «1... X .. 

X - ( D(» # ) iqj(fl) D T (I,) )-l ( D(j # ) iqj(fl) IX M (fl.fl) ♦ fl g0 (fl) - E,j - D(* # > I, ) 0.14) 

Equation (3.7) togatbar with aqaatloa (3.14) completely ebaraatarlxa tha aqaatloa of aotloa of 
oooparatlag robot with aataraal aotloa ooaatralata oa tba objaot. 


ltipl 


4. CONTROL OF MULTIPLE COOPERATING tOBOTS 

la thla aaetloa va vill introduce a ooatrol taohalqaa which 1a based oa the oparatioaal apaca foraalatloa 

to coatrol tha poeltioa of tha object and lataraotloa foroaa of tha objaot with the aataraal oawlroaaaat. Tha 

approach presented hare is baaed on distributing tha actuation forces among tha eras such that sack ira 
contributes to tha aotloa of tha object and to its own inertial forces ia a predetermined manner. Let us begin 

by introducing a decoupling general isad force saoh that 

Ec - -U,d ♦ *fd<X.d - io> ♦ *fp<Icd - i«>l ♦ »..<9> [Sod ♦ Sd ( *od- *c> * Sp ( *.d - 

♦ Y,«(fl.i) ♦ flx.tfl) <4.D 

where Xcd * 1 desired lataraotioa force rector between the object and the extsraal environment. 

Ve auat now show that the above oholca for £ 0 does result in tha nultiple robotic system following tha 
apeclfied trajectory and applying tha deairad forces on the constraint surface. Ia order to simplify the 
derivation and without loss of generality, let ua ssaame that the poeition and force aubapacas are reordered la 
the aquations such that the first 6-n elements of the position vector correspond to the position controlled 
subapace and the laat m elements belong to the force controlled subspace. This means that, for example. 
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-•p?]:r 

r o I ) *- 

-tdi- 


Lot ii partition U* mi Mtrii M 


, *ai *21. 


-c :i 

,ktn MjjCl 4 "* 1 fltk tho ibava doooapooltloo of tko foraa-poottlaa oabopoooo, tko 0 attri* (*•• oqutloa 

3.11) can ba vrlttii as 

D(j c ) -C0l l mm 1 (4,7) 


»<«,) - 0 


*Uk tko .boo. okoioo for p(i # ). »• «•» »o* ooapoto tko oloood loop dymoaloo of tko aoltlp.o aaaipolotor oyotoa 
wltk E oo tko goaorol Ixod ooatrol foroo vootor gloom by oqaotloa (4.1). Lot ao partttloa tko ostool foreo. 
•ooalara tioa, and valooity vcotors aa 


fioll flail f**l' 

• la " • *« “ . 

LiolJ ll. 2 j L i#J . 


oad oloo ••loot t tf . K td . Ipp 


r° ° 1 r° ° i k* °i k* °i 

l fd • • f fp “ * Sp “ • S« " } 

L° k fpJ L° k fdJ ° °J L 0 °J 

where k^'i art diagonal aatrioea. Fran aquation ( 4 . 1 ) E 0 ia glvan by 

r ° 1 r ° ° ^ *.d k pd *p I" Sp *pl i 

L^od J L k fd if. , k fp *f. -° . . 0 - L 0 J 

akoro If - Xid " iol 1,4 ip “ lid _ *ol 


♦ x„ ♦ a* 


The raactlon forca X 0 ba oonpatad by substituting fro* aquation (4.11) for f 0 In aquation (3.10) and (3.14). 
After soma nanipulaiio* vf aqua t Iona, ona obtains 
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3. EXAMPLE 


la order to validate tht propoaad theory, a simulation a tody via undertaken. Tha aodol eoaaittvd ot 
simple plaaar robotio system composed of a pair of tvo-lluk revolute aaalpal a tora. Tha objeot vaa aaavaad to 
a poiat ana attachad to tka mppar lftak of aaok of tka aaalpal a tora. ace aach aaaipalator kaa oaly t 
dagraaa of fraadoa. tha objact vaa aaaaaad to ba ia aoataet with tka appar link* oaly through iataraotioa fore 
ratkar tkaa throagh forcaa aad ' orques. 

Tao eaaaa vara coaaidarad. la tka flrat eaaa it vaa aaaaaad that tkara vara no aavlroaaaatal coaatraiai 
This eaaa rapraaaata a para traaaport problaa vkara tvo aaaipalatora eooparata ia moving aa objact. Ia t 
aaeoad eaaa. aa environment vat attaaad vkiek raatrietad tka aotioa of tka objert in tha x direction. Ia bo 
eataa, tka daairad aotioa la tka y dlractioa vaa obtaiaad froa a eoaataat accalaratioa trajectory. Tka daair 
aotioa la tka x diraetioa vaa tat aqaal to tka iaitial s value (l.a., x ■ 0) ia tka flrat eaaa. aad tka daair 
Interaction force batvaaa tha objact aad tha environment vaa aat aqaal to zero la tka accoad eaaa. 

Tka simulation of closed kiaaaatic chelae caa altkar ba performed by computing tha iataractioo forcaa baa 
on tha dynaaica aqaatlont (aach aa thoaa developed ia thia paper) or by aaaaaing tha axiatanca of atiff tpriu 
(or apr ing-dashpot) at tha contact points. Since ia actaal experiments, one naaa forca/torqaa aanaora to obta 
thaaa interaction forcaa (torqaaa). tka aaeoad approach ia need in thia atndy. Figure 5.2 illuatratea t 
■odaling of tha contact aechanita batvaan tha nppar links and tha objact and betveen the objact aad t 
anvironaant. Figure 5.3 ahova tha overall aiaulatlon block diagram. 




Figure 5.2 Detailed Modeling of tha connections batvaan (a) tha arms 
and tha object, and (b) tha objact and tha external constraint 


Desired position 
and force _ 



Robot/ 

environment 

dynamics 

(Continuous 

system) 


A 


Desired position 
and force 

p. 


Figure 5.3 Simplified block diagram of the simulation study 

Although this paper does not attempt to address tha digital control aspects of the problem, the simulation stu 
vaa made more realistic by separating the continuous and discrete parts as shovn in Figure 5.3. 
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Figures 5.4 through 5.6 show 


the system for the following set of parameters and control gains 

link lengths 

• .04 (m] 

, equal for all lengths 

link masses 

- 4.0 U|I 

, for lower links 


• 2.0 (E|) 

, for upper links 

object mats 

• 2.0 (tf) 

• 

Initial joint angles: 

» n - 30° . 

* 12 - 120° 


» n • 150° , 

» 22 - -120° 

Position loop gains 

k pp - 4900.0 . 

*p4 ■ 9,0 

Force loop gains 

k fp - 1.0 . 

kfd ‘ 0 

Spring constants 

500,000 [N/a] for all of the springs 


140.00 [N/(b/,.c)1 

Sampling period 

1 asec 



Figure 5.4 shows the tricking capability of the object sioved by the eras. Since the difference between the 
position of the object and the tip of each upper link is insignificant, the plots show the desired va. actual 
position of the upper links. Figure 5.5 shows the tracking of the object in the second case (notion constrained 
in the x direction). Figure 5.6 shows the interaction forces between the object and the environment. The 
tianlation study indicates that the control algoritha developed in this paper yields excellent results. It nest 
be understood that several important practical probleas such as friction, flexibility of the links and joints, 
link parameter errors, etc., were not included in this simulation study. Further research and study is 
necessary to include such effects. 
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Figure 5.4 Pure Position Control with Cooperating Arms 
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PLOT of Load Position vs. TIME 
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Figure 5.5 Position Tricking in Posit lon/Forcc Control with Cooperating Arts 


LOAD-ENVIRONMENT INTERACTION FORCES vs. TIME 
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Figurs 5.6 Force Tracking in Position/Force Control with Cooperating Aras 


CONCLUSIONS 


This paper presented a theory for the position and force control of Multiple nsnipolators holding an object 
which is in contact with an environaent. The derivation is for n nsnipolators each having six degress of 
freedoa. The control is based on the Cartesian foraelation of the ara dynanics and extends the single-erm 
hybrid position/force control concept to the case of multiple aras. Siaple but realistic sianlation studies 
confirmed that the developed control concept results in excellent position tracking and force control. 


202 



Tha (tiiiril dasarlbad i a this 4«tia«it vii p«rforB«i at tkf Jet Propel* ion Likontorf, CaJifariU 
IutitiU of Teekaology, aider contract vltk th National Aeroaaatics nt Spaa* Aiaiiiitntlot. 
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A Survey of Adaptive Control Technology in Robotics 


S. Totunoglu and D. Tesar 
University of Texas at Austin 
Austin, TX 78712 




ABSTRACT 

This paper reviews the previous work on the adaptive control of robotic systesis. 
Although the field is relatively new and does not yet represent a mature discipline, 
considerable attention for the design of sophisticated robot controllers has occurred. In 
this presentation, adaptive control methods are divided into model reference adaptive 
systems and self-tuning regulators with further definition of various approaches given in 
each class. The similarity and distinct features of the designed controllers are delineated 
and tabulated to enhance comparative review. 


1 . INTRODUCTION 

Control of robotic manipulators is a challenging problem mainly due to tne nonlinear 
and coupled nature of the system dynamics. A considerable amount of valuable work has been 
produced in the dynamic formulation and the control of these systems within the last two 
decades. Since the pioneering works of Uicker { 1 ) , Hooker and Margulies {21, and Kahn and 
Roth [31 on the formulation of dynamics, researchers have concentrated on the efficient 
computer implementation and numerical construction of the dynamic equations, while the work 
on the efficient dynamic equation algorithms is still going on, control of manipulators has 
also received significant attention. Over the years, literature on the manipulator control 
methods using optimization, linearization, nonlinearity compensation, and recently, adaptive 
technique* has become quite rich. 

This paper reviews the accumulated work in the area of adaptive control as applied to 
robotics. The reader should note that adaptive control in itself is not yet a mature 
discipline in systems .heory. Also, since some of the existing tools in adaptive control 
are strictly for linear and/or time- invar lant systems, their application to robotics 
deserves special attention. The immaturity of adaptive control is best demonstrated by the 
lack of a definition of adaptive control agreed to by the leading researchers [4]. 

According t. Webster's dictionary, to adapt means "to adjust (oneself) to new 
circumstances". Adaptive control, then, in essence, is used to mean a sophisticated, 
flexible control system relative to the conventional fixed feedback system. An adaptive 
system will assure quality system performance when large and unpredictable variations m the 
plant dynamics or loading occur. Although our aim is by no meant to establish the missing 
definition, since the robotics community seems to have reached a consensus on what is meant 
by adaptive control, we will give our definition to illustrate our interpretation of 
adaptive control. 

Definition : A feedback control system is adaptive if the gains are 

selected with the on-line information of plant outputs 
and/or plant state variables. 

This definition is depicted in block diagram format in Figure 1. The above definition 
encompasses all the previous work on the adaptive control of manipulators currently 
available to us. 
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Although the urly work on adaptive control dates back to tho 1990a, its first 
extensive application to robotics was given by Du bows ky and DesForges In 1979 (9). tines 
than a variety of dlffarant as t hods has basn developed. So far, tha existing adapt iva 
control methods applied to robotics say ba categorised under tha design of 

1. Nodal Reference Adaptive Systeas (MAAS) 

ii. Self-tuning Regulators (STR) 

The following sethods are used in the design of MAASt 

i. Local paraaetric optimisation 

ii. Lyapunov’s second sethod 

ill. Hyperstability theory 

iv. Sliding control theory 

The STR design procedure say be divided into three steps s 

i. Selection of a parametric structure to represent the robotic system via 
discste-time modeling 

ii. On-line estimation of system parameters using the least squares, extended least 
squares or maximum likelihood methods 

iii. On-line controller design based on the estimated system parameters via extended 
minimum variance or pole-sera placement techniques 

Block diagrams of MRAS and STR are illustrated in Figures 2 and 3. Note that the 
dotted boxes in these figures may be reduced to the regulator block in Figure 1. After s 
brief review of system dynamics, the related background work is presented below. 



Fiqvz a 2. Block Oiagraa of *odal ftafaranca Adaptive Syatea ftqur* 3. Block Olagr— of Self-tunine Regulator 
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2 


SYSTEM DYNAMICS 


Dynamic equations of an n-link, n -d eg ree-of- freedom, spatial, sarial robot an with 
rigid links ars given by 


At#) I - r ( I# i) • - G ( I) I - u 


(1) 


whara #cR n is tha relative joint displacement vactor {* • (•* i*J*cR 2n )» A(t)ck n,tn - ia tha 

generalised inartia Matrix, -f - -P(ft,i)icR n raprasants tha inartia torquas dua to 

can trifocal and Coriollis accelerations, -g - -C(4)4cR n is tha gravity loads as saan at tha 

joints whara G(t)«I ,Mtn is non-uniqve, and u f R n is tha control. In state-space 
representation, Cq. (1) can ba given by 


A~ l G A -1 F 


i ♦ 


(21 


Mot a that functional dapandancias ara droppad for clarity. If aach actuator (O.C. Motor) is 
aodalad as a sacond-ordar , linear, tiMa- invariant subsyatam (neglecting tha anatura 
inductanca) , and is couplad with tha Manipulator dynasties, tha praviously dafinad stata 
vactor, x, will ba preserved and tha control will ba tha actuator input voltage. In this 
casa, Eq. (2) takas tha following fon 



0 

X 


1 

O 

X - 

A * 1 (G-Ej ) 

A _l <r-E 2 ) 

X ♦ 

1 

1 

A X Ej | 



1 

L. —1 


whara a • A ♦ J is tha cowbinad inartia matrix with J • diagfJ ], J ic tha rotor inartia of 

k k 

tha k th actuator rafarrad to output shaft, E ^ , Ej and Ej ara diagonal, positiva dafinita 
constant matrices and functions of various actuator/gaar train paraaatars. 

Although most of tha works do not includa tha actuator dynamics, the abova, 
simplified form may ba substitutad, sinca tha form of tha aquations remains tha cam . 
Depending on tha adaptation algorithsi, thasa constant actuator paraaatars may aithar ba 
includad in tha on-line idantif ication schama or assuaad known. In our prasantation, tha 
ganaric u will raprasant tha suitabla control (aithar tha affactiva input torquas or tha 
voltages j . Tha only axcaption is (6] whara third-order actuator dynaaics is studiad in 
addition to tha abova siaplifiad form. Tha dynamic equations, Eq. (2) or (3), may ba givan 
in tarns of tha robot-hand coordinatas axprassad in a fixad rafaranca frama (task-oriented 
coordinates ) and adaptive controllers may be designed for this systesi (6,?, 8]. 


3. KRAS-BASED CONTROLLERS 

In KRAS design, usually a sacond-ordar , linear, time-invariant, continuous- time 
rafaranca modal is selected for aach link of tha sarial robot. Than, a control law is 
derived to force tha robot to behave like tha selected modal. As mentioned earlier, local 
parametric optimization (5,9], Lyapunov's second method [10], hyperstability (11,12,13], or 
tha sliding control theory [14,15,16] is usually employed to achieve tha goal. 

In 1979, Dubowsky and DesForges [S] implemented tha local parametric optimization 
technique on a robot arm. In their formulation, aach servomechanism is modeled as a 
sacond-ordar , single-input, single-output system, neglecting the coupling between system 
degrees of freedom. Than for aach degree-of- freedom, position and velocity feedback gains 
ara calculated by an algorithm which minimizes a positiva sami-daf inita error function 
utilizing tha steepest descant method. Stability is investigated for tha uncoupled, 
linearized system model. This work represents the first implementation of adaptive control 
to robotics. 
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The recent works hsvs concentrated on the designs based on ths Lyapunov's second method 
and the hyperstability theory. In the most general case, thasa control methods yiald tha 
following control structura u * 


u p ‘ Vl'WVV * Vi'WW * 4 3 f 3 J '*p ' x r ' *r* 


where subscripts p and r raprasant tha plant and rafaranca model, respectively, 5 ^ is aithar 
0 or 1 ( K^cP nxn nonlinaar or constant gain matrix, 1*1,2, or 3, a^cR* raprasants an 

unknown systaa paraaatar Ilka tha payload, link mass contant, cantar of gravity location, 
ate., vhara a coablnation of thasa constant paraaatars or a nonlinaar tara is to ba 
astiaatad in tha adaptation proeass, and j»l,2,...,k, vhara k dapands on tha spaclfic 
controller design. Although some controllers call for plant joint accelerations, they are 
not shown in Cq. 14) . 

Tha first tans f, in Cq. (4) describes tha nonlinearity coapensation. It may 

represent tha complete manipulator dynamics as in (17,16), or only tha gravity terms and tha 
Jacobian as in (7). a controller with « 1 and 5^ • * 0 indicates only a nonlinearity 

compensation. Tha second term in u p represents the feedback portion of tha controller. Tha 
gain matrices may either be nonlinaar or constant. No v • 0 and 5^ • 1 represent 

tha control structures of (16,19,20,21) among others. Tha third term in u p includes tha 
portion of tha control where systaa parameters are estimated (17,18,22). Slotina (18), for 
example, includes all tha components Into his controller, therefore, 6^ * Jj • Jj - 1. 
Takegaki and Arimoto's control strategy (7) may ba susttarized by ^ • 1 and - 0, 

Horowitz and Toaizuka’s (22) * 0, * 1, etc. 

Various MRA5 control structures are summarized in Table 1. This table differentiates 
tha methods which require explicit calculation of dynamic equations (Nonlinearity 
Compensation) from tha methods which adaptively estimate tha plant parameters on-line 
(Incorporation of Plant Parameter estimation). However, further distinction is needed in 
tha latter group, since while one approach explicitly identifies the nonlinear terms (as ia 
A, G, and r with reference to Cq. (2)), and estimates them on-line, tha recant methods treat 
the constant robot parameters as unavailable, estimate and compensate them in their 
algorithms. Some methods choose nonlinear feedback matrices in their controllers (H, L, S 
in Table l) without incorporating the explicit system parameter* estimations . 


The early works presented in Table 1 have generally avoided the nonlinearity 
compensation and opted for the assumption that the nonlinear system parameters vary slowly 
in time. On this basis, a stability analysis is given for the system. This assumption 
almost certainly is too restrictive, since the nonlinear manipulator system parameters are 
functions of the jcint position and velocities. The faster the robot movement is, the more 
rapidly the system parameters will vary. The objective on the other hand, for the more 
sophisticated control methods is to enable fast robot movements with high precision. As a 
result of revolutionary advances in the microprocessor industry with prices steadily coming 
down, the possibility of real-time implementation of computation intensive algorithms is 
steadily improving. Recently, Wander and Tesar (23,241 have implemented the complete 
dynamic equations [251 of a 6-link, general architecture robot arm in 6.5 milli sec. (150 
Hz) in explicit form without using recursion. Their algorithm is able to treat an 
n-degree-of-freedom (DOF) serial system of completely general parameters (43 ailli sec. for 
l 2 -DOF) . They have implemented the algorithm on an Analogic AP-500 array processor. 

Further work on the comparative analysis of various computer architectures is underway at 
the University of Texas at Austin. 


Some of the most recent works include nonlinearity compensation along with a feedback 
portion and parameter identification features (6,17,181. Once the control has the form » 

. ? 

A (9 )u* where subscript p denotes the plant, A_ is the on-line calculated generalized 
P P P P 

inertia matrix and is yet to be selected, generally, global stability of the closed-loop 
system can fce shown provided that where I is the identity matrix of order n, is 

maintained (6). Otherwise, in reference to Table 1, all methods without nonlinearity 
compensation need to assume system parameters stay constant during the adaptation. 
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Table 1 . Summary of MRAS Controllers in the Literature 



1 : Calculates c omplete or partial nonlinaar dynamics on-line. 

; : Robot link lengths, mast corrtanta, actuator parameters, ate., il not otherwise specified. 

3 : If "yes", stability anaiyaia baaad on this aaaumption. 

G : Gravity load companaatad; alao raquiraa on-lina Jacobian calculation. 

A : Raquiraa only tha on-iina calculation of tha inartia matrix. 

0 : Nonlinear-gain taadback ualng local paramatrie optimisation. 

C : Conatant-gain faadback. 

L : Nonlinaar-gain faadback ualng Lyapunov'a second method. 

H : Nonlinaar-gain faadback uaing hyrerstablllty thaory. 

S : Nonlinaar-gain faadback uaing adding thaory. 

N : Structure of nonlinaar syatam paramatara (funetlona of state variablaa) are explicitly assumed 
known and are adaptively estimated; stability analysis based on hyperstability thaory. 
v : Yea. 

- : No. 
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Bales trino, ft al. (19] have developed an adaptive control lar which produces 
discontinuous control signals leading to chattering. Stability analysis is presented using 
hyperstability theory. In (16] , Balestrino, et al. present three methods; the first is 
based on the theory of variable-structure systems, the second on the hyperstability and the 
third is a combination of the first two methods. Designed controllers produce 
high-frequency chatter which is highly undesirable since higher order dynamic modes may be 
excited. Numerical simulations show an extremely high frequency of sign switches in the 
plant input, prohibiting its physical realization. Stoten [21] formulates the MRAS problem 
closely following the procedures in (9) and simulate the algorithm for a 1-link manipulator. 

Horowitz and Tomizuka [22] study the adaptive control of a 3-link arm. Gravity effects 
and the mass and inertia of the first link are neglected. Each nonlinear term in the 
dynamic equations is identified a priori, treated as unknown, and estimated by an adaptation 
algorithm. For the modeled system and the designed controller, stability analysis is given 
by Popov's hyperstability theory. Later, Anex and Hubbard [26] have experimentally 
implemented this algorithm with some modifications. System response to high speed movements 
is not tested, but practical problems encountered during the implementation are addressed in 
detail. 

Takegaki and Arimoto [7] propose an adaptive method to track desired trajectories which 
are described in the task-oriented coordinates. The suggested controller compensates 
gravity terms, calculates the Jacobian and the variable gains, but does not compensate the 
manipulator dynamics completely. System stability is assured if the manipulator hand 
velocity is sufficiently slow, i.e., nonlinear system parameters change slowly. 

Nicosia and Tomei [27] derive control laws using the hyperstability theory to follow a 
linear, time-invariant reference model. The plant (manipulator) parameters and the payload 
are assumed known and are not identified. Their controller does not produce chattering and 
Is relatively easy to implement. Lim and Eslami (20] propose controller designs based on 
Lyapunov's second method. The author's objective is to control the linearized dynamic 
equations with the developed controllers; hence, assuring the stability of the linearized 
system. Later, nonlinearity compensation is suggested to enhance the system response. 

Whyte [281 designs an adaptive controller via Lyapunov's second method. The algorithm does 
not require any knowledge of the manipulator dynamics and selects nonlinear gains in the 
feedback loop to follow the reference model. System stability is shown, provided that the 
parameter changes are slow. Hsia [29] reviews the current methods used in adaptive control 
and gives brief formulations for each method. Vukobratovic , et al. [30], review local 
parametric optimization and hyperstability-based methods and choose not to include the 
approaches based on Lyapunov's second method in their book on the non-adaptive and adaptive 
control of manipulators. 

Tosunoglu and Tesar [6] select a generalized nonlinear reference model which represents 
ideal robot dynamics. The plant, the actual robot whose system parameters may not be 
exactly known, is then forced to behave like the reference model to follow the desired 
tra}ectory. The advantage of the nonlinear reference model is that the adaptation process 
ceases once the nominal trajectory is recovered. (Such is not the case when linear models 
are selected.) Error-driven dynamics is derived and the system is augmented to include the 
integral feedback feature to eliminate the parameter discrepancies between the plant and the 
reference model, and the disturbances acting on the system. It is shown that the 
controllers designed in this work via Lyapunov's second method also produce hyperstable 
systems. Simulations demonstrate successful trajectory tracking on 3- and 6-link, spatial 
manipulators under unknown payloads and estimated system parameters (link lengths, masses, 
inertia components, payload, etc.). The authors also provide comparative analyses of the 
effect of integral feedback and various controller update rates, 60 to 200 Hz. 

Craig, Hsu and Sastry [17] take an interesting approach in designing an adaptive 
controller using the Lyapunov's second method. In this work, the structure of the terms in 
the dynamic equations is assumed known, but their numerical values remain unknown. They 
partition the dynamics into known and unknown portions and estimate the unknown parameters 
along with compensation for the nonlinearities. Global stability is proved by assuming that 
a matrix function of the plant joint position, velocity and accelerations is bounded. 
Although all the terms which are functions of positions are bounded, velocity and 
accelerations may increase without bounds; thus, making the matrix unbounded. However, 
modif ications in the controller structure may alleviate this problem. Their numerical 
simulations identify link masses and Coulomb friction coefficients for a two-link 
manipulator with encouraging results. 
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8 lot ins and Li [18] derive a control law with full faadforward dynamics compensation, 

PD feedback and on-line payload and manipulator parameter estimation using Lyapunov's second 
method. Since this control scheme does not eliminate the steady-state errors, the authors 
restrict the steady-state position errors to lie on a sliding surface. This modification, 
in turn, causes the loss of numerical efficiency where. Interestingly, the authors make use 
of the recursive computation feature of the manipulator dynamics. Later, an approximate 
implementation is suggested to improve the numerical efficiency. Payload parameter 
Identification is simulated on a two-link manipulator. 

Once these current methods are refined, application to manipulators with higher degrees 
of freedom will naturally follow. Determination of the structure of the constant terms (for 
identification) for manipulators with higher number links may be achieved with symbolic 
generation of dynamic equations, but the effect of increased number of terms will require 
further investigation. 


4. SELF -TUNING REGULATOR (STR) BASED CONTROLLERS 

In this method, typically, nonlinear manipulator dynamics is linearized about a nominal 
trajectory and then discretized. The discretized model gives the structure of the 
parametric model whose parameters need to be estimated on-line. The parametric model is 
given by the following multivariable difference equation 

y (k) - 0 T 4 (k-1) + e(k> (5) 


where y(k)cR n is the system output, k is the sampling time counter, QcR nx * 2nm+1 * contains 
the parameters to be identified, ♦cR (2n ®* 1 * represents the combined system input and output 

vector, ecR n is a random, zero-mean Gaussian white noise, and m is the order of the 
estimation model. 

Parameter estimation is based on the system identification techniques using the sampled 
input-output data. Although such techniques include the least squares, extended least 
squares and maximum likelihood methods; the recursive least squares method is extensively 
used because of its lower computational requirements [8,25,29-36]. The recursive least 
squares estimation is given by 


e.(k) « 8 i (k-1) + P(k)4(k-1) (y i (k) - 0*(k-l) ♦ (k-1) ] (6) 

where 

poo - 4-t poc-h - , 

A x ♦ ♦ik-l) P (k-1) * (k-1) 

e i (k) represents the estimate of the i^ row of 9 defined in Eq. (5), P(k) is a square 
symmetric matrix of order (2nm+l), and 0 < A < 1 is an exponential forgetting factor. 

Once the parameters are identified at each sampling time, regulator parameters are 
estimated using the extended minimum variance [8,30,32], or pole-zero placement techniques 
[29,33,35]. The method described above is known as explicit or indirect STR. If the 
regulator parameters are estimated directly by a reparameterization of the process model, 
the model is called implicit or direct STR. Usually implicit STR algorithms cancel all 
process zeros making them suitable only for minimum phase systems. 

Koivo and Guo [32] assume an autoregressive model and identify system parameters using 
the recursive least squares technique. They design an extended minimum variance controller 
for the estimated model. The method chooses a quadratic performance index in terms of the 
state error vector and the system control vector and minimizes it relative to admissible 
controls while satisfying Eq. (5) . Their simulations Include decoupled and partially 
coupled parametric model structures. They report that the parameter convergence is faster 
in the decoupled case, and that no significant Improvement in the system response is 
observed for the coupled model. This is rather interesting, because the amount of on-line 
calculations is considerably reduced for the decoupled case. Also reported is the fact that 
the model and the controller parameter identifications may not converge fast enough while 
the robot motion takes place. 
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Lee [8] derives the perturbation equations, discretizes then and estinates the systen 
parameters using the recursive least squares method. Then an adaptive controller is 
designed using the extended minimum variance technique. The parameter identification 
requires the estimation of 6n J parameters on-line (216 for a 6-link manipulator). Lee 
provides a detailed breakdown of the computational requirements and concludes that for a 
6-link manipulator the control scheme can be updated at about 56 Hz on a PDP-11/45. 

Hsia [29] reviews the STR formulation on a decoupled model. Kamik and Sinha (35 J 
develop an STR based on a non-minimum phase model which assigns the system poles while 
retaining all the zeros. This algorithm is developed for a UNIMATE-2000 robot. Landau 
[9,36] and Vukobratovic, et al. (30] review various STR designs in detail. 

In general, discrete-time formulation is especially suitable for computer control. 
However, on-line estimation of all system parameters and the control design make STR 
computationally intensive. Astrom [4] reports that numerical estimation techniques tend to 
be numerically unstable as the number of parameters increases in the system model. In this 
case, the complete system is parameterized. However, the papers reviewed in this work do 
not raise the question with regard to numerical instability although they do indicate the 
importance of initial parameter selections. In STR methods, convergence of estimated 
parameters in the adaptation process is guaranteed if the system parameters are constant. 
Since the actual robot model parameters are nonlinear functions of the state vector, the 
question of system parameters varying slowly in time again arises in the STR methods. 


5. CONCLUSIONS 

Adaptive control of robotic systems has received significant attention within the past 
few years. A class of control laws based on the MRAS design realize the adaptation through 
signal synthesis with a completely known parameter structure, while some methods select a 
subclass of the parameters for identification for reduced computational burden. All 
adaptive controllers via STR design and some MRAS-based methods estimate the complete 
(nonlinear) system parameters on-line. 

Stability analysis usually relies on the condition that the nonlinear system parameters 
vary slowly. This condition is removed if a nonlinearity compensation component is also 
incorporated i the controller. The most recent works, which exploit the special structure 
of manipulator dynamics, seem to favor this feature. The use of state-of-the-art 
microprocessor technology along with the sophisticated dynamic formulation algorithms 
strongly indicate that real-time implementation of dynamic compensation is rapidly becoming 
feasible. 

Further research to perfect the existing algorithms and to provide rigorous stability 
proofs, which will improve the maturity of the adaptive control, is still needed. Although 
today's industrial robots employ linear controllers to accomplish a number of useful tasks, 
fast and precise robot movements remain to be implemented. Development of laboratory test 
beds and implementation of the developed adaptive controllers on robotic manipulators are 
also crucial, since only after successful demonstrations will technology transfer be 
possible. 
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Simple Robust Control Laws for Robot Manipulators, 

Part I: Non-adaptive Case 

J.T. Wen and D.S. Bayard -r h ' '■ ' ^ 
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California Institute of Technology 
Pasadena, CA 91109 


A new class of exponent lal ly stabilizing control laws for joint level control of robot arms Is Introduced. 

It has been recently recognized that the nonlinear dynamics associated with robotic manipulators have certain 
inherent passivity properties. More specifically, the derivation of the robotic dynamic equations from the 
Hamilton's principle gives rise to natural Lyapunov functions for control design based on total energy 
considerations. Through a slight modification of the energy Lyapunov function and the use of a convenient 
lemma to handle third order terms in the Lyapunov function derivatives, closed loop exponential stability for 
both the set point and tracking control problem Is demonstrated. The exponential convergence property also 
leads to robustness with respect to frictions, bounded modeling errors and Instrument noise. In one new design, 
the nonlinear terms are decoupled from real-time measurements which completely removes the requirement for 
on-line computation of nonlinear terras in the controller Implementation. In general, the new class of control 
laws offers alternatives to the more conventional computed torque method, providing trade offs between 
robustness, computation and convergence properties. Furthermore, these control laws have the unique feature that 
they can be adapted In a very simple fashion to achieve asymptotically stable adaptive control. 


1. Introduction 


The problem of Joint level control for the multi-link rigid articulated robot arm Is addressed In this paper. 
Accurate measurements of the Joint variables, either angular or displacement, and the Joint velocities are 
assumed available. Traditionally, this problem has been treated by the PID algorithm. Since the justification 
of using PID control is based on either linearization or some local stability argument (1], Its application 
is limited to small angle maneuvers. Large excursions usually require partitioning a desired trajectory Into 
Intermediate points and PID control Is used to drive the arm between adjacent points. This approach la less than 
satisfactory since global stability and adequate performance are not guaranteed. This then motivates the 
computed torque method [21 which compensates for nonlinear terms In the robot dynamics. Assuming that the robot 
dynamics are known exactly, the compensated system apoeara like a decoupled system of double integrators and 
the closed Loop dynamics can be shaped Into desirable forms. 

A different approach has been advanced in the past few years. It Is based on the recognition chat robot arms 
belong to the class of natural systems, which means time Invariant, unconstrained and lying in a conservative 
force field (3]. It is natural to investigate If the structure specific to this class of systems can be 
exploited in controller design. It has long been known [3,4 and earlier] that negative proportional (generalized 
position) and derivative (generalized velocity), or equivalently PD, feedback globally asymptotically stabilized 
natural systems. The stability analysis is based on a Lypaunov function motivated by total energy considerations. 
Application of this result to robot arms has been relatively recent. In particular, global asymptotic stability 
under Joint level PD feedback with gravity compensation has been shown [5-8]. Application to the tracking problem 
is more difficult due to the time varying nature of the problem. More specifically, the stability analysis 
requires a generalization of the Invariance principle to time-varying systems; this Issue has been partially 
addressed in [9-10]. 

In this paper, we will introduce a new class of exponentially stabilizing control laws for both the set point 
and tracking control problems. The stability proof is achieved by making use of a particular class of energy-like 
Lyapunov functions in conjunction wLth a useful lemma for addressing the higher order terms in the Lyapunov 
function derivatives. In the set point control case, Lyapunov functions based on various artificial potential 
fields are used to derive control laws possessing desired properties. These include set point controllers having 
simple PD or PD+bias structures and the ability to handle Joint stop constraints. In the tracking control case, 
this new class of Lyapunov functions avoids the need for a generalized invariance principle, which, as mentioned 
above, has been the major source of difficulty In existing approaches. This leads to a new class of exponentially 
stabilizing tracking control laws. In one design among this^new class, the nonlinear terras are decoupled from 
real-time measurements which completely removes the requirement for on-line computation on nonlinear terras In 
the controller implementation. This result is believed to have no counterpart in the present day literature. In 
general, the new class of control laws offers alternatives to the more conventional computed torque method, 
providing tradeoffs between robustness, computation and convergence properties. Furthermore, these control laws 
have the unique property that they can be adapted In a very simple fashion to achieve asymptotically stable 
adaptive control. This last property will be elaborated on In the companion paper [13]. The closed loop 
exponential stability also allows the robustness property to be established with respect to viscous and Coulomb 
friction, bounded modeling error and instrument noise. 
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This paper is organized as follows: Some background derivations, identities, notations, lemmas and relevs 

results In the literature are covered in Sec. 2. Several useful set point controllers based on different arti- 
ficial potential energies are presented in Sec. 3. A new Lyapunov function is also Introduced to demonstrate 
exponential convergence. In Sec. 4, a new family of exponentially stabilizing tracking control laws are derive 
We will discuss the trade off between the ease of implementation and the strength of assumptions for these 
controllers. Their robustness properties are also analyzed in this section. Finally, conclusions are draws 
in Sec. 5 together with a table summarizing all of the controllers presented in this paper and the conditions 
for stability. Due to the size limitation of this paper, the detail derivations are given in • 

2 . Background 

2.1 Robot Dynamic Equation 

In this section, the dynamic equation of robot manipulator is derived. At the first glance, it appears 
as a complex, tightly coupled set of nonlinear equations. However, based on derivation from Hamiltonian 
principle, the nonlinearity actually contains a great deal of structure. As a result, soma important 
Identities are developed in the next section on which the rest of the paper is based. 

An n-link rigid robot arm belongs to the class of so-called natural systems with the kinetic and 
potential energies given by 

T - J 4 2 T M< qi )q z 

- ( 2 . 1 ) 

U - - q 2 u + g(q x ) 

where 

T • kinetic energy , U * potential energy , q^ • Joint angle or position vector e R n , 

q 2 * Joint velocity vector c R n , M(q^) • mass inertia matrix e R nxn , gCq^) * gravitational potential energy , 
u * joint torque force vector e R n 

Note that since all the analysis is done at joint level, the arm can be redundant (more than 6 joints). 

To derive the differential form of the robot dynamics, first set up the Lagranglan 


Then apply the Lagrange's Equation 

!_ (AL.) . . o 

dt k 3q/ 3q 2 


This gives the dynamic equation of robot motion: 


M(qj)q 2 * - C( qi ,q 2 )q 2 - k(q 2 ) + u 


CC^.qj) - I ((e t q 2 T M 1 <q 1 )) T - y (ej q^M^q^)] 


■ ith unit vector 


3g(q,) 



1 3q u 

* ith component of 

The term C(q 1 ,q 2 )q 1 represents the coriolis and centrifugal forces and k. (q ^ ) represents the gravity load. 
Note that C(q^, q 2 T is determined entirely from the mass-inertia matrix. Many desirable properties, for 
example, inherent passivity, well-posedness of solution (no finite escape under any bounded control), 
existence of solution to optimal control problem [14) etc. are the consequence of this additional structure. 
Other important properties of (2.2) include that M(q^) and M^(q^) are symmetric and M(q|) is positive 
definite, for all cR n . For later use, the matrices CCq^^) and M^(q^) are interpreted as R nxn valued 
function of two n-vectors (q^ and q 2 ) and one n-vector (q^) , respectively. 

2.2 Some Useful Identities 


Some key identities that will be used throughout this paper and the companion paper [13] are derived 
in this section. First define some notations: 
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(2.4) 


n _ 

• J M^q^z • * 

M ( i 1 .q 2 ) • If M(q l ) 

J(q lt *) - [( Bl z T M 1 (q 1 » - (e t z T 

r ^ q l’ q 2 ,q 2d^ " ^2 _< '2d^ T ^7 ^ q l ,q 2^ q 2” q 2d^ ” **^ q l ,q 2^ q 2^ 


(2.5) 

(2.b) 

(2.7) 


^ld’ q 2d " d##lr€<1 Joint position and joint velocities 

4q i * "1 • q id 
aq 2 ■ q 2 - q 2d 

Again, ^ and J are regarded as R nxn valued function of two n-vector arguments. 
T 

matrix, i.e., J+J • 0. 

Identity 1 

M(q lt q 2 )* * M D (q 1 .*)q 2 

Identity 2 

C(q 1 ,z)2 - \ (^(q^z) - J(q l .*))* 

Identity 3 

J( qi ,z) - M ]) T (q 1 .*) - ^(qx-*) 

Identity 4 

M D T (q 1 .*)y - «D T <q x- y) * 

Identity 5 

r(q 1> q 2 ,q 2( j) “ | &q 2 T (-7(qx' q 2 )q 2 d “ M D <q l ,q 2d* q 2* 

* i iq 2 T (J(q 1 .q 2d )q 2 _ tl o { qx’ q 2 )q 2d ) 

- i Aq 2 T (j(q 1 .q 2d )q 2d - VV'^'W 

. i aq 2 T (J(q r q 2 )q 2 - 

Identity 6 

H D (q 1 .Aq 2 )q 2 - - 7 * q 2 d ) q 2 ) 


( 2 . 8 ) 

Note that J is a skew symmetric 


(2.9) 

( 2 . 10 ) 

( 2 . 11 ) 

( 2 . 12 ) 

(2.13) 

(2.14) 

(2.15) 

(2.16) 


- « D (q 1 -“q 2 ) q2 “ c(q r q 2 )q 2 "I (J(q r q 2d )q 2 ' VvV'W 

■ 7 (M D T (q 1 ^q 2 ) Aq 2 + M D T(q l’ q 2 d )4q 2 ‘ M D (q X ,q 2d )Aq 2 + M D {q X ,4q 2 )q 2d ) <2 ' 1?) 

M D (q 1 ,Aq 2 )q 2 - C(q L ,q 2 )q 2 - 7 (J(q i ,q 2d )q 2d " **0^1 • q 2 )q 2d ) 

- (M D T (q 1 .q 2d ) - H D (q 1 tq 2d ))A< ? 2 + 7 + 7 V q l ' ^ q 2 )q 2d (2 * i8) 

M 0 (q l*‘ iq 2 )q 2 * C ^ q l ,q 2^ q 2 ’ 7 (J(q l ,q 2 )q 2 " ^ q V q 2d^ q 2^ 

*7 ^ M D^ q l ,iq 2 ^* iq 2 + M D^ q l ,iq 2 ^ q 2 d^ 

2.3 Important Lemmas • ! 


tn this section , two important stability lemmas are presented that will play pivotal roles in later 
sections. The first lemma is essentially a local stability theorem that establishes a region of convergence. 
The second lemma generalizes the first result to the Lagrange stability case. The proofs of these lemmas are 
given in [23] . 
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Lemma 2-1 


Given A dynamical ay* tern 

*i " ^ 1 (* 1 »*“* X N » O * *i c * ni » t 10 

uc fi*# be locally Lipschitx with respect to x lt ...,x N uniformly in t on bounded Interval* end continuoue 
in t for t >. 0. 

n« x • • • x iL. 

Suppose e function V: ft x is given such that 

N T 

v <*i V ° • it \ ml x i V x i x r t)x j • 

V is positive definite in x^»...,x^ 


V(x 1 Xjj, t) <- l (o t - l Y 1;J I Ixj (t)l! ki ^llx 1 (t)|| 5 


( 2 . 20 ) 


where a ±f Y^t k^ > 0, C. I x C{1,...,N) 


Let > 0 be such 


€ t l l x t l I iV(x x V t) 

L«c V e $ V(x x (0) x M (0), 0) 


If V 1 c I x . 


V lil 

2 


then V 


v k n 

o.— +- 


*1 » I i-f-) 

JeI *i J _ 

x i c (0 * °i ' Y u ^ 2 

v (x. V tli'I x 

1 " iclj^ 

In the above lemma, we choose to bound over £ Y t1 ||x (t)||**J 

Jd,, 


V t > 0 


(2.21) 


( 2 . 22 ) 


(2.23) 


(Condition (2.20) reflects that choice). This choice is arbitrary; in fact, we can extract any quadratic 
cross term from 


l y.. llx.(t)|| k U |lx.<t)ll 2 
Jcl 2l J 

and overbound the rest. After completing square stability condition similar to (2.22) can be stated. 
We do not pursue this generalization here. 

Lemma 2-2 


Suppose In Lemma 2-1, 


V(x. X ,t) i - I 

1 N Id* 1 


I x - (1 N) 


[ Y lt l|x,l| k U) llx l| 2 + 0 
Jd 2l « ^ 1 


Let p - sup | |P(x x ,t) | | < 

x i£ :R n i t tcR+ 1 N 


[P] 


iJ 


1J 


(2.24) 


If V i e I 


1 

op 


vt + j YlJ (r) 


v Y 

o x 2 




J 


Chen 


limsup V (x <t) 3^(t),t) i. j 


where X - - min (a 
? id, 1 


3«I, 


'1J 'K 


v ^ 

(T 2 ) 2 ) 


1 2 i 

Furthermore, the convergence to the set { (x^ , . . . ,x^) : ||: 


V -U 

k 


(2.25) 

(2.26) 

(2.27) 

ktl^) is exponential with rate X. 
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The utility of this result Is mainly In robustness analysis. Basically, given a bounded aat of 
poselble initial conditions, axe lading a neighborhood of tha origin, at' a aunt ba larga anough in tha 
aanaa of (2.23) for tha trajectories to ba uniformly bounded. Vhat if V 0 • 0? Wa can ahlft t - 0 to 
toaa finite tine latar when Vt + 0. in practice, a neighborhood around origin can uaually ba excluded 
a Inca some robuet locally stable control algorithm such aa P1D takes over. 

2.4 Recant Results 

So*a of the recent results related to Lyapunov analysis of robot systems are reviewed in this section. 
For the set point control problem, [3-10] has applied the result that linear negative feedback of generalised 
position and velocities globally asymptotically stabilises a natural system to robot manipulators. Va will 
restate this result, mention work for the tracking problem in (9. 10.18) and state some open Issues that 
will be addressed in the remainder of this paper and in [13]. 

Theorem 2-1 

Consider (2.2) with tha control law 

u - - K p - K y q 2 + kCq^ , K p > 0. > 0 (2.28) 

The null state of (Aq^ q 2 > - system is a globally asymptotically stable equilibrium. 

The main idea of this approach is to shape the potential field in such a way that it la globally convex 
and attains a global minimum at Aq^ • 0. Complete damping (in the terminology of [3]) is added through the 
derivative feedback to drive the system to the minimum potential energy state which by design is the 
desired state. To be specific, suppose the desired potential field is U*(Aq^). The total energy under 
this potential 1$ 


Rewrite V as 

V ■ T + U° + U* - U° • V° + U* - U° 

where U° is the original potential energy without external foTce fields, and V° la the corresponding 
total energy. Let p - H(qx)q2 be the generalised momentum. From Hamilton* a equation, 

• ,3V°T T/ 3U* 3U° ' 

V ’ < 3T ) U+q 2 ( 3^" 

T, . 3U* 3U° . (131 

“ <2 <U + TZT X ' ( 

Hence, co drive Che desired cocal energy Co ice nlnimua scece, ve can select ((51) 

v 3U* ^ 3U° (2.3 

u ■ - K q- - - . + ’TJ— 

v^2 3Aq, <>Aq. 


Then V - - q 


T K q v From the fact that -2q T K q Is uniformly bounded (V <_ 0) , q (t) ■* 0 as C - • [19]. 


3T _ 3U 
~ 3Aq 1 


3T iU^L 

3q x ‘ v q 2 ?Aq L 


Slnce 11 o - U — - 0, also. Hence, if U*(Aqi) is globally convex with minimum at Aq. » 0, Aq T (t) - 0 

’ q l 3Aq i * - ‘ 

as t -* •• If U*(Aq 1 ) ' y Aq x Wq^ Theorem 2-1 is Immediately obtained. 

Obviously, any other potential field convex In Aq* that has global minimum at Aqi - 0 (and no local 
minima) can be used. We will use this Idea in the next section to address the joint stop Issue. 


This control law Is very appealing in its simplicity and obvious robustness with respect to 
modeling error In mass matrix, and centrifugal and corlolla terma. Generalization to the tracking 
problem is partially addressed in [9, 10]. A control algorithm is given in [9], but it lacks stability 
analysis. In [10], Matrosov's Theorem [11] is used. A question remains on ths necessity and 
applicability of the Matrosov' a Theorem to the tracking problem. One version of the tracking control law 
In Section 4 is the same as in [10]. but the stability issue is resolved more completely. Nonadaptive 
version of the tracking control laws in [10, 18] do yield global asymptotic stability. However, the 
simple structure of (2.28) is lost; even for set point control, full model information is reeded. 
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Based on the above very brief review of the currently available pertinent results. It Is evident that the 
following lsauee remain open: 

1. Can we get away with no gravity information, thus achieving a "universal” (arm Independent) set 
point control law? 

2. Computed torque achieves exponential stability. Are schemes based on energy Lyapunov functions 
Inherently inferior (e.g., only asymptotic stability la possible) or have we not been clever enough 
In choosing the Lyapunov functions? 

3. The tracking problem produces a time -varying system. Can the Invariance Principle still be applied? 

4. How far can we reduce the on-line computation requirement (thus allow Increasing performance) for 
both set point and tracking problems? What la the price to be paid? 

3. Can we ensure any robustness properties with respect to friction, Instrument noise, modeling errors? 

6. How does one incorporate Joint stop constraints? 

7. Would these schemes (set point and tracking controls) still work If unknown parameters are adapted? 

The rest of this paper will be devoted to answering issues 1-6. The last item Is addressed 
in {13! and [24) . 

2.3 Computed Torque from Lyapunov Perspective 

In Section 2.4, we Introduced the total energy Lyapunov function (2.29) to derive a simple set point 
control law. The computed torque method can also be motivated in the same manner with a different 
Lyapunov function. For generality, we will consider the cracking case. Let 

V(aq^.iq^) - i|| 4 q 2 || 2 ♦ j AqJ K p 4q j (2.33) 

Calculate derivative along solution, 

V - iq 2 T (M’ l (q 1 )(-C(q 1 ,q 2 )q 2 - Mq^ + u) - ♦ K p Aq^) 

If the computed torque control is used 

u - k(q t ) + C(q lt q 2 )q 2 + M^Mq^ - U p Aq 1 - K y Aq 2 ) (2.34) 

then 

V 1 - i< l2 T V q 2 

From the same line of reasoning as before, the closed loop system Is globally asymptotically stable. 

However, we know that the closed loop system Is linear, therefore it Is exponentially stable. This means 
that we should look for a better Lyapunov function. An obvious choice Is to add In s cross term In 
(2.33). Then 


V(a qi ,Aq 2 ) - | ||Uq 2 || 2 + J + cdq^dqj (2.35) 

where c is small constant so that V Is positive definite. Take derivative and apply (2.34), 

V - - iq 2 T K v 3q 2 + Ciq^ K4q t * c)|iq 2 || 2 - cflq^ - cflqj K y iq 2 

‘ * ( °nln < *v ) ' C) H 4q 2 li2 ' c4q l T V q l 
which shows closed loop exponential stability. 

Note that In (2.34), In contrast to (2.28), even for set point case, full model nonlinearity 
cancellation Is needed. The approach In this paper is to use the energy Lyapunov function instead of 
(2.33) to generate control laws. We will see In later sections that this affords a much larger 
class of controls which contains much simpler structure In certain esses (especially for set point 
control). 


3. New Results on PD Sec Point Control 


3.1 Simple PD Concrols 

In this section, w * will explore using different U* In the controller desian 
suggested In ( 5 J : 

* 1 Vv q i + * (iq i +q id > - * (q id ) - V k(q id> 
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The following has been 


U*Oq l ) 


( 3 . 1 ) 


If Kp 1* sufficiently large, • 0 is tbs global minimu m of U(Aqj). Hence , s simpler control law esn bs used: 

“ * - V<1 - Vi ♦ k < q ld> (3.2) 

Suppose each joint Is conetralned between joint stops: 

o (t) < < o< h > 

q ll - q li- q li 

and the set point Is In the Interior of the joint Inputs: 

q u q u - q ud l q u q a 


(3.3) 


(3.4) 


Let the desired potential function be 

n 


(3J) 


(3.6) 


U*(A qi ) - \ (H t (4q u ^ nd ) ♦ V^*,^)) 

where and are appropriate upper and lower barrier potential functions for Joint 1 ^ 

Then, “Sj • 0 la a global minlauai of U*(£q^) [23j • Pro* (2.31) 
u - - K y q 2 - IC p Aq l - HfAq^ - L(4q 1 > ♦ Mq^) 

Sl.ll.rly, If K p l. .ufflct.ncly l.rg. (K p ♦ ^ (^ ♦ q u > > 0). th« follovln, control law .1.0 .chl.vo. 
global asymptotic stability: 

u . - K v q 2 - K p 4q l - H^j) - LCAq^ ♦ «R ld > ° <7) 

Control laws (3.2), (3.6) and (3.7) still require Information on the gravity load. It Is Interesting to 
ask If this last piece of model Information can be removed. This case corresponds to the desired potential 
energy 

U.fiqj) • | 4qj T K p 4qj + gUq^j) 

The corresponding control law is 


(3.8 ) 


V q l - K v iq 2 


(3.9 ) 


From section 2.6, 
3U* 


3 Aq 


(Aq,) - K^Aq, + kfAq^q^) 


P H 1 


This implies 


1 insup | | Aq (t) | 1 1 a -ln (K ) »up | 
t— 1 “ l " P q^R" 


k(q 2 ) I 


’k(q,) 


If K + 


p * ”3q * ° * * K^kfAq^+q^) ** a conCracClon ®*P ln Aqj- Then 2 * q^ 


* such that 


V fl l** q ld ) + k<q l* ) * 0 (3.10) 

Urn q t (t) - qj» 

This result suggests a very simple, robust and practical control scheme. The feedback gain K can be chosen 
lar8 fk(qj) 8h tD ^ ustlfy the u#e of PID contro1 li. ch. 6 of 19) which Is locally stable. p Typically, k(q J ) 

and — j<q — are composed of trigonometric functions, therefore, they are uniformly bounded. 

3.2 PD Control with Exponential Convergence Rate 

Uaa of the Invariance Principle in Section 2.4 only shows asymptotic stability. Some guaranteed rate of 
convergence la highly desirable not Just fer performance reasons but for robustness analysis and adaptive 
control also. In Section 2.3, a Lyapunov function with cross term has been used to show exponential 
stability. This suggests a similar modification here. The result is summarized below. 
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Theore* 3-1 


Given tha control lav (2.31) 


- K 


3U* ^ 2U 


(2.31) 


Suppose a V > 0 such that 

<*!» * v H^iM 1 ' «•“> 

and U*(Aq 1 ) haa a global alnlaua at • 0, than tha closad loop (Aq^.q^ syste* la exponentially atabla. 
Proof: 

Modify tha total anargy Lyapunov function (2.29) to 

V - T + U* ♦ cAqj T p+| caqj T 

whara c la a amail constant so that V Is posit iva daflnlta In p and q^. Without loss of generality, U* can 
be considered positive daflnlta In q^ (by adding an appropriate constant). Than fro* (2.32), 

V • ^ P ♦ cV<- % - «£ * “> ♦ « , 2 T V q l 


* q 2 ? K v q 2 * ' «2 T M(q l >q 2 * ' V ^ * C V gf 


Let 


u • sup || M(q.)| | 

q lC R n 1 


'l j| A(j ; | wl * 2 CAq l K v Aq l ~ 2 Cji ^ > ° {or #OB * constant t. 

I V 1 

Uwm 2-1, V V, c (0, o 2 - y (r 2 )!) 

V 1 * ’jiUqjH 2 - > 2 !|4q,|| 2 
" Jq l * 2 l-l * l ,J q 2 ’ 2 M D T(q l’ q 2 )<, 2 


(3.12) 

(3.13) 


Note 


(3.14) 


Then 


-- (o mln <K v> - c “>Mo 2 II‘ - c v I ^ I 2 - f ** M^.q^q, 


Define 

n 

Then 


n 

, - *up l | |M (q ) | | 

q x i-i 1 1 


(3.15) 


VI- •jlliqJI 2 - '.jIUqjll 2 ♦ YjjIliqJI lliq^ 2 


where 


« c ■ 


v (K ) - cu 
min v 


Y 

j 

Choose 


— c n 

2 l 


V i 
o.2. -1 


C ' ’.l„ <%><“♦ J" 1 ^ 


where V - V 

o 


and 


c-0 
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(3.16) 


< - X V 


for iom X > 0. Hence, the closed loop system la axponantlally aeabla. 


I 


Given any U* according co <3.11), R > 0 and Initial condition, there always axiata c that aatlafiea 
(3.16). Even though c la not needed In the implementation, lta maximum allowable alze af facta the 
convergence rate. The artificial potentlala U* • 1/2 AqjT K p Aq^ , (3.1) and (3.5) all aatlafy the 
aaaumptlo no of Theorem 3-1. Therefore, the correapondlng cloaed loop ayotema are exponentially atable. 
for the potential given by (3. 8 ) and Kp Urge enough (JC + 3k > Q) , we can add a conatant to U* so 

P TqT 


that U* la positive definite in qj-q* and (3.11) la satisfied for Aq^qj-qJ. where q^ solves (3.10)* Then 
Theorem 3-1 Implies exponential convergence of q^ to qj which is within 0 n i n (Kp) »up 1 1 k(q^) | | from the true qj^. 


4. Hew Results in Tracking Control 
4.1 Exponentially Stable Algorithms 

Frequently a robot is required to follow a preapecifled path for continuous action at the end effector 
(e.g., arc welding), tracking of a moving target (e.g. pick and place operation from conveyer belt) or 
ocher high level objectives (e.g., time optimality, collision avoidance, arm singularity avoidance, etc.). 
This can be posed as the problem of tracking the desired positions and velocities (q^^ • 92d) b y In 

this section, we will extend the basic Ideas put forth In Section 3 to the tracking problem. The error 
equation is now in the form 

Aq t - Aa 2 

M(qj)iq 2 • - C( qi ,q 2 )q, - k^) . u - fUq^q^ <*-l> 

We will first state several direct generalizations of Theorem 3-1. An energy type Lyapunov function 
similar co (2.33) uoed in Section 2.5 to motivate computed torque is used here. 

Theorem 4-1 


Consider (4.1) with the control law 

“ • - V« 2 + + M(q i )q 2d ' 0<, r q 2 ,q 2d ) 

where D Is given by any one of the following expressions 


D(q 1 .q 2 .q 2d ) * J <J<q 2 .q 2 >q 2 d ' * q 2 d )q 2 ) 

° <q l ,<, 2 ,q 2d > ' I (J(q l’ q 2d )q 2 * M D (q f q 2 )q 2d ) 
D(qj.q 2 .q 2d ) - 7 (J( V q 2 d )q 2 d " *V q l ,q 2 ,q 2d* 

D(q r q 2- q 2d ) - 7 (J(q 1 ,q 2 )q 2 - 
Assume 3u > 0 such that 

iq i T 51^ (iq i } * u H q lH 2 

and U*(Aq^) Is positive definite in^q^. Then the null 
exponentially stable equilibrium. 

Proof : Use the following Lyapunov function 


(4.2a) 
(4.2b) 
(4.2c) 
(4. 2d) 

(4.3) 

state of the Ciq^.iq^ system is a globally 


V(-q 


r ^q 2 ) * iq 2 M( qi )iq 2 + UMAq^ * c 


M ( q x ) Aq 2 + j C 


‘^1 V <1 


(4.4) 


where c is a small constant, such that V is positive definite in Aq. and Aq,,. 
solution: A 


DU* 


V (A qi ,iq 2 ) - Aq 2 (M(q 1 )Aq 2 + j H D ^i*^ 2 )q 2 + DA^j” + c 

+ c K v i qi ) + C i qi T (M( qi )ii 2 + M Q (q l ,iq 2 )q 2 ) 
Substitute (4.1) and ( 4 . 2 ) and use (2.7) 


Take derivative along 
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V(Aq 1> &q 2 ) ■ Aq 2 T (-C(q 1 .q2) , »2 ” + u " ^l^d + 2 M D^ q l' 4q 2^ <, 2 


+ (4q x ) + c M(q 1 )4q 2 + e VV 

+ e 4q 1 T (-C(q l .q 2 )q 2 - Mq^ + « * + ^(<1 , 4q 2 )q 2 ) 

• r(q 1( q 2 .q 2d ) - 4q 2 T (K v - cM(q 1 ))4q 2 - 4q 2 T 0(q lf q 2 ,q 2d ) 

+ c4q 1 T (M p (q 1 ,4q 2 )q 2 - 

- C *1 SS^ (6q l> * ct *i D <**1 ’**2 *^2d^ 

Apply Identity 5, t - 4q 2 T D ■ 0. D«£ln« r^. Pj •* 

n, ■ *up l I |Mj(q. ) 1 1 
1 q^R" i-i 

n, - »up llq 2d IK 

I2d 

From Identity 6, 

|caq^(Mjj(qii ^2^2 * ~ D ^1 #< *2 *^2d^ I 

< c||4, 1 ||<«n 2 ||Aq 2 || I |m 2 i | 2) 

where »-3 f or (4.2*) , *-| for (4.2b) , for (4.2c) , «-j for (4. 2d) 


V(4q lt 4q a ) < - (^(K,) - <*) ||4q 2 || 2 - c v | 1 4q x | |* 


+ c | |4q 1 ||(*n 2 ||4q 2 H +j^||4q 2 || 2 ) 


Completing square for the cross term, 

V(Aq x ,Aq 2 ) 1 - a L | | Aq L | | 2 - a 2 | |Aq 2 | | 2 + y 21 I | Aq x | | | 1 Aq 2 | [ 2 


, 1 2. 

* c(v - j a n 2 o ) 

1 an 2 

“2 ' °»ln ( V * c <“ + 7— 5 

P 

Y 21 m 2 C °1 

2 2 °1 

Given j, choose p < . By Lemma 2-1, for 

an 2 

i an- i V l . 

‘<<WV (u + i -r + K ( r )2) 

o i 


(V , L are as defined from the proof of Theorem 3-1) and V X- c (0,a. - y 2 , (t 2 ) ) 

o 1 — — — * '•j 

V < - ajldqjl 2 - X 2 l|Aq 2 U 2 


for some X > 0. Hence, the closed loop system is exponentially stable. 

A common Lyapunov function used for tracking problem has been [9, 10] 
V(i qi .Aq 2 ) - J Aq 2 T Mfq^Aqj + UMAq^ 
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In this case, a generalization of Invarlanca Principle to time-varying case la required. There are two 
possibilities. The result In (Theorem A. 7. 6, 21] appears promising* but we must verify that (4.1) la positive pre- 
compact (in the sence defined in [21]). A more direct route is to use [Leona 1, 22] which states that if 
A$2 *nd Aq2 * c * *>° ch bounded uniformly in t (It follows from V < 0), then Aq 2 (t) * 0 implies Aq 2 (t) *» 0. 

Note that U*(Aq^) does not depend on time explicitly. This restriction eliminates some of the 
candidates used in set point case. How to generalize to the case of U*(Aq^,t) and SU* (Aq ,t) not 
necessarily negative semideflnlte is currently under investigation. $ t 

Control laws (4.2a-d) all have same stability property nominally. When q 2 is a very noise measurement, 
as la typically the case, (4.2c) which only uses q 2 once may have better robustness. 


Note that all the controllers have structures very similar to computer torque; in fact, if all occurrences 
of q 2d are replaced by q 2 , then the nonlinear compensation is exactly the sane as the case of computer torque. 
However, in their present forms, (4.2a-d) cannot take advantage of well known recursive algorithms for inverse 
dynamics computation (11, 12]. Therefore, we next present slightly modified versions that can be Implemented 
with these algorithms. 

Corollary 4-1 

Consider (4.1) with the control law 

u - - KyAq 2 + k(q x ) - (Aq x ) + ,q 2d )q 2d (4.6) 

satisfies the same assumptions as In Theorem 4-1. 


state of the (Aq^.Aq^) system la a globally exponentially stable equilibrium. 


(4.1) with the control law 

U • - * v iq 2 + Mq^) - (a q r ) + MCq^q^ + CCq^q^qj (4.7) 

where U*(Aq^) satisfies the assumptions as In Theorem 4-1. Given a set of possible Initial conditions, if 
Ky is sufficiently large, then the closed loop system is exponentially stable with respect to chat sec. 

1 T 

If U* (Aq^) * j Aq^ K Aq^, ( 4 * 7 ) is actually a modification of the computed torque method with K p , 

Ky replacing fUq^Kp, MCq^Ky. 

So far we have generated many control laws that are similar to computed torque. However, the ones 
just requiring > 0, v > 0 are not easily implementable, and the easily implementable ones need stronger 
conditions (Ky sufficiently large). The next control law that we shall present is of very appealing 
structure: the real time undate comoutatlons are linear and the off-line computation can take 

advantage of efficient algorithms (e.g. v Newton-Euler type). The trade-off is chat K v and v must both 
be large enough for a given set of initial conditions. 

Theorem 4-2 


where U^fAq^) 
If 




then the null 
Corollary 4-2 
Consider 


Consider (4.1) with the control law 

u - - ♦ k(q ld > * fj^ <6q t ) ♦ + C(<1 ld * <, 2d )<l 2d (4 ‘ 8) 

where U*(Aq|) satisfies the assumptions as in Theorem 4.1. Given a set of possible initial conditions, if Ky 
and v, are sufficiently large, then the closed loop system is exponentially stable with respect to 
that set. 

Typically, q 2 (0) - q 2 d(0) * 0 and Aq^(0) is always within 2w. Hence V Q is bounded above, and the result Is 
essentially a global one. This scheme requires both v and K v large enough. This requirement is made easier 
by shifting the computational burden to offline thus allowing very high sampling rates which in turn means 
high gains can be tolerated. 

4.2 Robustness 

Lyapunov analysis provides a useful approach to study the robustness Issue. Robustness Is a much abused 
word in the literature. Here, we use insensitive design to mean preservation of stability (in the sense of 
Lagrange) under sufficiently small perturbations. Furthermore, the size of the ultimate bound should vary 
continuously with the size of perturbations. By robust design we mean a controller design that preserves 
stability under prescribed size of perturbations. In this section, we will examine frictions, both viscous 
and Coulomb type, bounded modeling error and bounded actuator and sensor noises. 
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Friction forces can be approximately modeled by Coulomb friction, or dry friction, and vlacoua friction 
due to oil lubrication. For joint 1, the frictional force la given by 

f me • - r n •«" <«ai> - r 2t "21 • • r il* F 21 * 0 (4<9) 

Equation (4.1) la then 

HCq^Aqj • - C(q!»q2 )q 2 * k(q l ) + u " M <qi^2d “ P 1 “ F 2 q 2 

where ?i and F 2 «r« diagonal matrices with elements F u , P 2l . respectively, agn(q 2 ) represents a vector 
with elements *gn(q 2i ). 

From [23], the set point controller la both Insensitive and robust. The tracking controllers arc also Insensitive 
with respect to frictions. For robust design for a given level of friction, *^0^) * nd v should be chosen large 
enough. s 

Next we consider modeling error In controller Implementation. Model parameters k, M, M D can all be 
written linearly In constant parameters. Assume bounded errors are Incurred in these parameters. 

Control laws (4.2) are in Che form 

u " • V q 2 ' (4q l> + k<q l ) * K(q l )q 2d " D(q l’ q 2' q 2d ) + 4 1 + 4 2 

The additional terms in V are 

- (CAq l + dq 2 ) T (4j + 4j) 

which, aftar overbounding [23], becomes 


<c | | Aq x j I + I I Aq 2 | |) ^ 1 ^ 2 ^ * \ ll^ll*) 

After completing squares, the overbound over the extra terms In V become 

c 


+4 ) 

! ( Vj + _ rr _) "V 


+ (4 +-2y + -l-| ) 

2 Y 2 Y 


! Aq- 


♦ C«, 


llAqJI 


+ 


lAq 2 | i 3 ♦ \ (ca x 2 + s 2 2 ) 


For sufficiently small. 3^, >0 such that 

v I - Oj I |4q t l I 2 - » 2 I 1^2 I I 2 + c(^ + 4 4 ) | |4q 1 I I I I ^ | 1 2 + « 4 I 1 ^ | | 3 ♦ 0 

By Lemma 2-2, If V Q >0 and <^ t 5j are small, the system remains Lagrange stable and the ultimate bound 
vanishes as ^ , $ 5 *► 0. Hence, the design Is insensitive with respect to modeling errors. For robust 
design for a given level of modeling uncertainties, a min(K v ) and v should be large enough. 

Finally, suppose bounded errors Cp e 2 an ^ c 3 are incurred in qp q 2 and respec t ively , Control 
laws (4.2) are now in the form 


u * - \ ^2 ‘ (i V + k(q l ) + M( V q 2d ■ D(q l ,q 2 ,q 2d ) + 4 1 + *2 

Follow the sane steps as before, we overbound Che extra tarns In V by suns of ||Aq,|| 2 , ||Aq 2 || 2 , ||Aq, || ||Aq_ || 2 
and constant terms. Again use Lemma 2-2 to conclude that the controller is Insensitive to bounded instrument 

noises and robust for a given level of noise if 5 . (K ) and v are sufficiently large. 

min v 

As an aside, it should be noted that similar results as derived here hold for computed torque 
techniques also. For the case of set point control under friction, the insensitivity and robustness 
properties of the design here do not follow directly from the analysis. 

There are obviously many more practical implementation Issues not addressed here: sampling effect, 

actuator saturation. Joint and arm flexibilities and instrument dynamics. They are currently under 
investigation in the same framework. 

5. Summary 

We have Introduced a new class of exponentially stabilizing control laws for the joint level control of 
robot manipulators (summarized in Table I). The stability result is achieved by making use of a particular 
class of energy-like Lyapunov functions (of the form (4.4)) In conjunction with a useful lemma (Lemma 2-1) for 
addressing higher order terms In Lyapunov function derivatives. This approach avoids the need for a generali- 
zation of the invariance principle to time-varying systems, which has been the major source of difficulty in 
the past (9,101. 

In the set point control case, by Incorporating artificial potential fields in the Lyapunov function, we have 
derived a class of exponentially stabilizing, PD + potential shaping type of control laws. Several useful 
potential fields have been examined resulting in simple structures: PD and PD+bias, and the ability to handle 
joint stop constraints with PD + joint-stop-barrler controller. 
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In th« tracking control case, the modified Lyapunov function leads to a new class of exponentially stable 
control lava. This class of control laws offers an alternative to the conventional computed torque method and 
provides trade-offs between on-line computation (which directly relates to performance through maximum sampling 
rate) and condition for stability* In one new design, (4.8), the nonlinear structure is decoupled from the 
real-time measurements which completely removes the requirement for on-line nonlinear computation. The chart 
below illustrates the trade-offs in the various tracking control laws. 


On-line 
Computat Ion 
Load 


(4.2a) 
(4.2b) 
(4.2c) 
(4. 2d) 

computed 

torque 


(4.6) 


(4.7) 


(4.8) 


v.K v 0 
for any 
initial 
condition 


v>0 

K v large enough 
(Independent of 
Initial condition 


large enough 
for a given set 
of Initial 
cond it ions 


large enough 
for a given set 
of Initial 
cond itions 


The framework of Lyapunov stability analysis also allows robustness issues to be directly addressed. 
Specifically, insensitivity property (preservation of stability under small perturbation) and condition for 
robust design (preservation of stability under a specified amount of perturbation) fer this new class af 
controllers have been derived with respect to viscous and Coulomb friction, modeling error and bounded 
Instrument noise. 

The new stability analysis and controller design teqhniques presented in this paper open up many promising 
avenues for future research. In particular, our current research directions include: ways to Incorporate tine- 

varying artificial potential fields in the tracking problem and the generalization of the exponentially 
stabilizing Joint-level control laws to the task space. 
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Simple Robust Control Laws for Robot 

Part II: Adaptive Case 

D.S. Bayard and J.T. Wen 
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California Institute of Technology 
Pasadena, CA 91109 

Abstract 


Manipulators, 

7J * — 


A new class of asymptotically stable adaptive control laws is introduced for application to 
the robotic manipulator. Unlike moat applications of adaptive control theory to robotic 
manipulators, this analysis addresses the nonlinear dynamics directly without approximation, 
linearization, or ad-hoc assumptions, and utilizes a parameterization based on physical (time- 
invariant) quantities. This approach is made possible by using energy-like Lyapunov functions 
which retain the nonlinear character and structure of the dynamics, rather than staple 
quadratic forms which are ubiquitous to the adaptive control literature, and which have bound 
the theory tightly to linear systems with unknown parameters. It is a unique feature of 
these results that the adaptive forms arise by straightforward certainty equivalence adaptation 
of their nonadaptive counterparts found in the companion to this paper (i.e., by replacing 
unknown quantities by their estimates) and that this simple approach leads to asymptotically 
stable closed-Loop adaptive systems. Furthermore, it is emphasized that this approach doe3 not 
require convergence of the parameter estimates (i.e., via persistent excitation), invertlbility 
of the mass matrix estimate, or measurement of the Joint accelerations. 


1. Introduction 

In past years, many papers have appeared on the application of adaptive control theory to robotic 
manipulators (cf., [2]-[7], and Hsia [8] for overviewVT It is a general property of adaptive designs 
based on Lyapunov's Direct Method, that the Lyapunov /function is chosen as a simple quadratic type, well- 
known and well studied in the standard adaptive con/rol literature [12] [13]. However, this particular 
Lyapunov function was originally motivated for applications to the standard adaptive control problems 
(i.e., linear systems with unknown parameters), and not for nonlinear dynamical systems. Hence, applications 
of standard adaptive control techniques to robotic manipulators invariably require rhe dynamics to be considered 
as linear. This in turn, requires the use of ad-hoc assumptions and/or analysis techniques including 
1) treatment of position dependent quantities fis unknown constants, for which they must be assumed to vary 
slowly with time; 2) linearization of the system about some local operating point-valid only for small 
excursions from nominal; 3) the use of linear decoupled models for the links, which neglects nonlinearities 
and crosscoupling effects; and 4) neglecting the nonlinear and time-varying dynamics completely by assuming 
the plant la linear. Hence, stability results based on these assumptions are questionable, and a rigorous 
proof of stability for adaptive control of robotic manipulators remains unresolved. 

A recent exception to the above criticism is due to the work of Craig, Hsu and Sastry [9]. Here, a 
useful “linear in the parameters" formulation is exploited to simplify the analysis, and to demonstrate global 
convergence of an adaptive version of the computed-torque control law - without approximation to the nonlinear 
dynamics. However, the resulting adaptive controller requires the invertlbility of the mass matrix estimate 
(which is not guaranteed a-priori), and measurement of the Joint accelerations (which is generally unavailable). 

It is suggested In (9J, that the former can be handled by projecting parameter estimates into known regions of 
parameter space for which the mass matrix Inverse exists, and in which the true parameters are required to lie. 
However, knowledge and calculation of such regions is not straightforward and appears to be a weakness of the 
method. 

In this paper, the "linear in parameters" formulation of [9] is used in conjunction with a different 
Lvapunov function. Here, the choice of Lyapunov function is more closely related to the energy of the system, 
aud better retains the nonlinear structure and character of the dynamics. In addition, many problems associates 
with adapting the computed-torque control law directly are avoided by making use of the new class of exponentially 
stabilizing controllers introduced in [1]. Although these controllers are very similar in form to the computed 
torque method, they have many advantages in the nonadaptive case (cf., [1]), and have the unique property that 
they can be made adaptive by using a straightforward certainty equivalence approach (i.e., by replacing unknown 
quantities by their on-line estimates). Furthermore, the class of adaptive systems defined in this manner 
can be shown to be asymptotically stable without approximation to the nonlinear manipulator dynamics. This 
approach does not require convergence of parameter estimates (i.e., via persistent excitation), invertlbility 
of the mass matrix estimate, or measurement of joint accelerations. 

In the most recent literature (i.e., preprints, conference papers, etc.) there appears to be other work 
currently taking place which combines the linear in parameters formulation with a new Lyapunov function [10], 

[11]. Although this work is very new and is evolving very rapidly, we will try to contrast our results where 
possible, and provide an overall perspective. 
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The format of the paper la at followa. In Sac. 2 the raaulta of [1] ara reviewed and summarized aa 
required for treatment of tha adaptive control case In Sec. 3, asymptotic stability ia proved for the claaa 
of lyactni arising from certainty equivalence adaptation of tha control lava in [lj. Slightly tangential to 
the main thruat of the paper la tha analyala in Sec. 4 of the adaptive computad torque mathod. Since 
the coaputed- torque control lav ia widely establlehed In the literature, and widely applied in practlcm. 

It la uaeful to apply the tcchnlquaa developed herein to ace to what extent it can be aade adaptive and to what 
extent a tab ill ty can be guaranteed. In Sec. 5, aeveral renarka are aade pertinent to the new adaptive designs, 
and conclualona are given in Sec. 6. 

2. Background and Notation 
2.1 Manipulator Dynaaica 

The well-known Lagrange-Euler equations of notion for the n-joint aanlpulator la given as follows. 


M(q 1 )q 2 - - C(q lt q ? ) - k(q 1 ) + u 


l* M 2 y 


where 


C(q l’ q 2 ) " l(e i q 2 T V q l ))T ' 1 (, i q 2 T J 

^ unit vector 

3M(q i> a th 

Vi* “ 1^— - : *>ii “ 1 component of q^ 


k(q^) • gravity load 


( 2 . 1 ) 

( 2 . 2 ) 


(2.3) 


Here, ucR n la a generalized torque vector, q^, qj, q 2 cR n are generalized Joint position, velocity and 
acceleration vector, (e.g., ia an angle or a distance for a revolute or prismatic Joint, respectively, 
M(qi)cR nxn is the syraaetrlc positive definite mass inertia matrix; C(qj,q 2 )cR n la the Coriolis and centri- 
fugal force vector; and k(q^)cR n is the gravitational load vector. 


2.2 Some Useful Identities 


Let the following notations be detlned, 

W* 5 - I K^)* ej 

i"l n 

M( q i. q 2 ) - 57 M( qi ) \\ 

J(q 1 ,z) - l [(e 1 z T M 1 (q 1 ) - (e 1 z T M 1 <q 1 ) ) T ) 


N ' q i - q id * *2 ' q 2 - q 2d 


q ld* q 2d " desire<1 J OAnc position and velocities respectively (q^ ■ q^) 


T f l 


rCqj^.qj.q^) • dq 2 [j M(q 1> q 2 )Aq 2 - C(q 1> q 2 )q 2 ) 


Using the above notation, the following Identities are quoted from [i] without proof. In these 
identities, x, y and z are used to denote arbitrary vectors of appropriate dimension, 

Identity 1 

M ^l'^2^* " ^D^l* 2 ^ whjer « vector z la arbitrary 


Identity 2 

C(q r z)z - y (M D (q 1 .z) - J(q r z))z 
Identity 3 

Jtqj.z) * M D T (q 1 ,z) - M I) (q 1 ,z) 
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2.3 Important Leon* 

In this section, a useful l«m le reviewed, quoted directly without proof fro* fl]. For convenience, 
thle result will be alternstively referred to ee the g-Ball Tews due to the aethod used to prove it. 

Ltnu 2-1 (0-Bell Lease) 

4 

Given e dynamical systea 

*1 * f i* x r*“ x lT t * * ^t*" 1 • 1 i° 

Let fj's be locally Llpschltx with respect to X},...,x H uniformly In t on bounded intervals and continuous 
in t tor t > 0. Suppose a function V:R n l x **' xn N x * *+ is given such that 

r T 

V(x 1 ,..., V t) - ^ l x t P 1J (x 1 X J ,t)x J , 

V is positive definite in % 1 x^ 

*<«! V° - ' I (o i - I > lj ll* i (‘)|| k «)||x 1 (t)|| 2 (2-*) 

UI 1 J el 2 t 


wh«r« ® 1# Y t j, > 0, «• U. 

Let > 0 be such that. 


e t l l x J I 2 i v< Xl x N ,t) 

ut v o 2 v(x, (o) yo.o) 


If Vic^, 


i * l Vf) 

JeI 2l J 3 


V ^ 

Ox 2 


then V X 


v Y 


c(0, a - I y <t*) 2 ) 

JtI 2i 3 3 

V(*, V e> I - I x 1 1* | 

A " iC^ * 1 




V t > 0 


(2.5) 


( 2 . 6 ) 


2.4 Exponentially Stabilizing Control Laws 

In [1], various new exponentially stabilizing compensators were Introduced for both the set-point and 
tracking control problems. For the purposes of adaptive control, it is of interest to consider the subset of 
this class summarized In Table I. In addition, the well-known computed torque control has also been Included 
in Table I for comparison purposes. It is noted chat the desired potential field U*(Aq^) used in [1] has been 
chosen here simply as, 

U*(a<) 1 ) - Y Aqj 1 Kp^. <2. 7) 

so as not to obscure the presentation with additional obstacle avoidance objectives. Nevertheless, many of 
the adaptive control results presented herein are easily extended to the more general case. 

It is useful to observe that all Control Lavs 1-7 differ from the computed torque method in that the mass 
matrix M(q^) does not premult iply the position and velocity feedback galna K. and respectively. This 
property Is critical since It renders this entire class of control laws amenable to simple adaptation schemes 
(i.e., certainty equivalence adaptation) which can be shown to lead to desired asymptotic stability properties. 
The presence of the mass matrix premultiplier otherwise prevents simple cancellations In the Lyapunov function 
derivative, hindering most attempts to apply adaptive control directly to the nonlinear dynamic manipulator 
equations. A recent exception to this can be found in the work of Craig, Hsu and Sastry [9]. 

However, the resulting adaptation law requires that ths estimated mass matrix be invertible for all values of 
estimated parameters. This in turn requires on-line projections of parameter estimates Into prespecified 
bounded regions of parameter space where M(qj) is not only invertible, but where the true parameters are 
certain to lie. This approach not only requires tight bounds on parameter uncertainty, but involves a very 
difficult (al belt off-line) determination of the proper parameter projection domains. This problem is 
further exacerbated by the fact that the adaptation law is not parameterized by physical parameters and Is 
of the form where the transformation back to physical parameters is neither straightforward or unique. These 
problems are overcome in this paper by using the exponentially stabilizing control laws of Table I, which do 
not involve a premultiplying mass matrix on the feedback gains. 
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TABLE I 


COMPUTED TORQUE CONTROL LAW 


CONDITIONS FOR STABILITY + 

CROSS-REFERENCE 
TO (I]* AND (20| 

u * 

■ - H< qi )oyq, 

♦ tt y Aq 2 > + ktqj) ♦ ♦ C(qj.q 2 )qj 

None required 

(2.34) 

NEW 

EXPONENTIALLY 

STABLE CONTROL LAWS 




1. 

u - - K/q, - 

V 2 ♦ "‘V + M(a l )a 2d 

- $ J(qj.q 2 )q 2d ♦ 5 VV q 2d )q 2 

None required 

(4.2s) 

2. 

u • - *^<1, * 

* H(, i )q 2d * 

2 J( V , 2d >, 2 * 2 VV q 2 )q 2d 

None Required 

(4.2b) 

3. 

u • - K p A qi - 

Vq 2 ♦ k< V * M( V q 2d 

- Y J< V a 2d )q 2d * * VV q 2 ,q 2d 

None Required 

(4.2c) 

4. 

. - - K p iq 1 - 

V a 2 * k <V + H< V q 2d 

- } J( qi .q 2 )q 2 ♦ 1 VV q 2d )q 2 

None Required 

(4. 2d) 

5. 

* • - V q i • 

V q 2 * k(a l ) * M(, l >a 2d 

* C( V q 2d >q 2d 

WV * r 

(4.6) 

6. 

» - - V q ! * 

V 2 * k( V + H( V a 2d 

♦ C(q r q 2 )q 2 

a sin**v* Bufflcl * ntl y l * r R* 
w.r.t. Initial condition 

(4.7) 

7. 

u . - K p iq l - 

X y iq 2 ♦ k(q ld ) ♦ H(q u )q 

2d * C(q ld ,q 2d )q 2d 

o J (K ) sufficiently large 
■ in v 

w.r.t. Initiel condition 

(4.8) 


^General Aaiunpcloni lh 2 d ll. I I I I boundtd ; k y * * °* K p " * p T * 0 

*Ut UMAdj) - j Aq^ K p A qi in (1| 


In the nonadaptlve case, comparisons between chs ntv control laws of Tabls I and ths computsd tor qua 
msthod can be found in (!]. Nevertheless, a brief account Is in order here. In particular. Control Lews 1, 
2,3,4 are roughly "on par" with the computed torque method in the nonadaptlve case, guaranteeing exponential 
stability with no conditions on Kp or K v . Unlike the computed torque method, however, they are not in s form 
suitable for application of the recursive Newton-Euler computation technique. This presently appears to be 
their major disadvantage. In order to overcome this difficulty. Control Laws 5, 6 and 7 were developed 
in a form suitable for recursive Newton-Euler computation. Relative to the computed torque method. Control 
Law 5 utilizes the desired velocity signal q 2d in place of the measured velocity q2 In the nonlinear terms of 
the controller. This "cleans up" the feedback signal in the sense that nonidealities due to sensor dynamics 
and measurement noise in q 2 are avoided in the nonlinear feedback terms. Control Law 7 further replaces q^ 

In K, M and C by qj<j. This decouples the nonlinear terms from real-time measurements, which completely 
removes the requirement for on-line computation of nonlinear terms in the controller implementation. Control 
Law 6 is exactly the computed torque method without the premultiplying mass matrix term described earlier. 

The advantages of these controllers are off-set slightly by the conditions imposed on Kp and K v fo* guaranteeing 
asymptotic stability i.e., that K v be chosen sufficiently large for Control Laws 1, 2, 3, 4, 5, 6 and that both 
K v and K p be chosen sufficiently large for Control Law 7. It will be seen in the adaptive case that these 
requirements can be removed by adapting these feedback gains appropriately. 

The use of q2<j rather than q 2 in many of the new control laws offers additional advantages. In particular, 
in the set-point control application q2d"^2d“ 0 ' Henc *» there is considerable simplification in the control 
laws relative to the computed torque method, i.e., the nonllneer terms vanish from the control law. This 
simplification carries over directly to the adaptive case and provides substantial simplification in set-point 
control relative to the recent adaptive control laws of Slotlne and Li (11] and Paden(10J. 


3. A New Class of Asymptotically Stable Adaptive Control Laws 

All of the new exponentially stabilizing control laws suuaarlzed in Table I have the unique 
property that can be adapted in real-time so as to yield asymptotically stable adaptive control systems. 
Furthermore, the adaptation is done in a certainty equivalence fashion, I.e., by simply replacing 
unknown quantities in the control laws by their estimates - as generated by an appropriate parameter 
adaptation algorithm. In this section, asymptotic stability for ths various control lavs will be 
proved, and the proper mechanisms for parameter adaptation will be derived. 
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The simplicity la ttructurm of the adaptive control schemes presented horo la largely duo to a 
*1 inoar In tha parameters" formulation of tha problem. This particular paraaatarlsatlon la bacoalng 
Increasingly popular In racant lltaratura (cf., [9] (10] (16) [19D and will bo dlacuaaad In nora detail 
bo low. 

3.1 Llnaar In tha Paraaatara Formal at ion 

A uaaful paraaatarlsatlon of tha nonlinear dynamical aquation* arlaoa by noting tha following 
relatione (x, y and t arbitrary vectors), 

C(x,y,)y • H (x,y)e 
C C 

M(x)s • H M (x,s)e H 
k(x) * 

V x>y)y - HpCx.yJdjj 

where He, Hr, H* and Hj> are known matrix valued functions of x, y and s, and where « c , 0 m* ®k» ®D 
vectors of constant parameter* related directly to true physical paraaatara (massej, Inertias, link 
length*, center of gravltlea, etc.). It la emphasised that this paraaatarlsatlon does not contain 
any hidden "slowly varying" states fn tha parameter vector definition and does not require any 
linearisation of tha dynaalcal aquations of aotlon. 

3.2 Clobal Asymptotic Stability for Adaptation of Control Lavs 1, 2, 3, 4 


In this section, global asymptotic stability Is proved for adaptation of Control Laws 1, 2, 3 and 4. 

In order to avoid redundant analysis, tha details of tha proof will ba considered only for Control Lav 1, 
and the extension to tha other control laws will follow immediately by taking advantage of the unified 
treatment of these control laws given In (1]. 

3.2.1 Asymptotic Stability 

Consider Control Lav 1, 

u°- - K p 4vV<2 * ' I J( V q 2 )q 2d I W^d 1 ^ (31) 

Here, superscript la used to denote the ideal nonadaptlve control lav, l.e., the completely "tuned" 
control law which would be used if the parameters were known exactly. Using the linear in the parameters 
formulation discussed in Sec. 3.1 there exists a matrix H 1 (qj, q2, q2d» 92d> and * v<ccor °* parameters 
8 such Chat, 

u°- - K p A<l 1 - r v A<, 2 + H t 8 (3.2) 


where 


V * "<1x>i 2 d - 1 J(q l* q 2 ), '2d + 1 M D (<, l ,q 2d )<, 2 ° >3> 

Here, the parameters in ® are constant with time and are related directly to physical link and payload 
parameters . When these parameters are unknown, the parameter vector & is replaced by Its estimate 
8(t) In real-time to give the following adaptive control law, 

u • - K - K,Aq 2 * P- 4 > 

Subtracting (3.2) froa (3.4) and rearranging glv.a 

u • u° + Hj ( 8 - 0 ) - u° + Hj* ( 3 . 5 ) 

This is an Important relation alnce it shows that the adaptive control la equal to the nonadaptlve control 
plus an expression which is linear in the parameter error * * 0-8. 

The proof of stability then followa by choosing the following Lyapunov function, 

V - V° + J * T 1*4 r - r T > 0 ( 3 . 6 a) 

T 

where V° is the Lyapunov function for the nonadaptlve control law used In [1], and where ♦ T* Is a positive 
definite function in the parameter error > ♦ For completeness, V° Is rewritten here (cf., [1],(4.4) where 

U*(Aq,) - J Aq, T K^Aq,), 


235 



(3.6b) 


v ° ■ I S* + i \ T + c V "(VS 


Taking tha derivative of V along ays t am t raj act or la a and aubatltutlng control lav (3.3) given upon 
rearranging, 

V - V° ♦ (6q 2 ♦ ci^ l ) T Hj 6 + i T l> ( 

where V° la tha Lyapunov function darlvatlva for tha nonadaptlvo cane, and- where the additional tarva 
Involving 4 on tha right hand alda of (3.7) arlaa dlractly fron tha additional tarva Involving 4 In tha 
control lav (3.3) and tha Lyapunov function (3.6) raapactlvaly. 

Tha aacond and third tarva of (3.7) ara cancelled exactly by tha choice of adaptation lav. 


i - e - - r _1 B 1 T (6q 2 ♦ c4q x ) 


Tha .xprasalon (or tha raaalnlng t.ra V° 1. alaply takan (roa (1) aa, (aaa (1), (4.5) vhara v ft o >ln (Kp), 
also note that Control Lav 1 correaponda to caae (4.2b) for which a * 2 ) 

2 


« X IKJI 2 - « 2 l|4q 2 H 2 + T 21 IN 11 HMjH 2 


(3>il 

°2 *WV (3 - 1( 

Y 21 • f \ (31( 

h, “ aa* I lq,.l |h (3.1) 

2 ’2d 2d 1 

h, ■ aa* (J | |n (q )| |) (3.1: 

1 ’1 1 11 

w - aas ||M(q )|| (3 . 1J 

q l 2 

0 < c < 1 arbitrary 
o* arbitrary 

t* arbitrary 

Applying tha 8-ball arxuaant of lama 2.1 to (3.9) ualng the valuea of at, a?, and y-m given in 
(3.10), It followa that If, 


. 3 3 

a . (K ) > t n- o 
•in p 4 2 


, n, n, V -r 

WV * e <- ♦ r-f ♦ t *r> > 
0 1 


»:■ ‘illiojll 2 - * 2 IIa« 2 II 2 


for any X ^ and \ 2 auch that. 


X lt (°. c(a atn « p ) -|n 2 p 2 )) 1 

x 2 =»• WV- c( ^ : i + r <r> J > 

0 1 


V o • V | 


C 1 • 7 t 0 -in% + <V - t2 c 2< M >) 
, 2 . i a . £_) a 00 

o (M) ft Bin o (M(q. ) ) 

“ ln 1 
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Since p Is Arbitrary, It can bs chosen sufficiently small so that (3.12a) la satisfied. Vlth this 
choice of P 2 , the value of c in (3.12b) can be chosen sufficiently nail so that Inequality (3.12b) Is 
satisfied. Hence (3.13) follows. This Is essentially the sane proof of stability as In the nooadsptlve 
case (c.f., (1] Theorem 4-1) with the following exceptions, 

1) The value of V Q In (3.12b) and (3.14b) now Includes the Initial parameter error j 4 T (0)r+(0). 

2) The value of c Is now required for Implementation of the parameter adaptation law (3.S). 

3) ♦ In (3.13) Is now only negative senldeflnlte In the state since the full state vector in the 
adaptive case Is augmented by 4. 

It Is noted that proporty 3 destroys the simple exponential stability ariument used earlier In the 
nonadaptlve case (cf., (lj, Theorem 4-1) to Insure asymptotic convergence ofllAqJI and I|AqJ|. In addition 
since tbs error eystem In (Aqt, Aqj»4) Is oonautooooous (and In general, nonperiodic), standard Invariance 
principles are not applicable. Alternatively, we make use of a leaa due originally to 
larbalat 9 quoted without proof from Fopov [14] (pg. 211^. 

Immt 3-1 (Barbalat) 

If V Is a real function of the real variable t, defined and uniformly continuous for t > 0 and if the 
limit of the integral ^ 

11m f W(t*)dt' , 

t-M * O 

exists and la a finite number, then 

11a V(t) - 0 . ■ 

t— 

For our purposes let, 

V(t) $ X l ||4q 1 (C)|| 2 + X 2 ||4q 2 (t)|| 2 
so that 

V<-w (3-^7) 

Integrating both sides of (3.17) from 0 to t, yields upon rearranging, 

, t 

«Jt' < » o • V(t) (3.18) 

o 

Since V 0 Is bounded, and V(t) Is nonincressing and bounded below, It follows that 

11a f W dt' < • 
t— o 

Also, since W is bounded, W(t) is uniformly continuous. Hence, application of Barbalat’s lam a gives, 

11a W - 0 (3*19) 

or equivalently | | 1 1 -^0 and | | 1 I 

This completes the proof of asymptotic stability. The proof, however, is not a global one due to 
property 2, l.e., the value of c which was not required in the nonadaptlve case now appears in the 
parameter adaptation law (3.8). Hence, one Is cOMltted to choosing a particular value of c in the 
adaptive implementation. Of course, c can always be chosen sufficiently small to satisfy the requirement, 
however, the position tracking performance determined by the magnitude oZ lj in (3.14a) must be compromised 
as a result. Bence In practice, the initial choice of c can be made using whatever bounds on ni» 02« u» 
o(M) and V 0 are available a-prlorl, and the value of c can be Improved (increased) on-line ss more information 
becomes available. It Is noted that (3.16a) and (3.16b) impose additional constraints on how large c can 
become, since it is required chat > 0 and > 0 for * positive definite V (these conditions can be shown 
sufficient) . 

The asymptotic stability proof presented above for adaptation of Control Law 1, is easily extended to 
adapt at ion of Control Lave 2, 3 and 4, since the corresponding nonadaptlve Lyapunov function derivatives 
for these control levs are of exactly the same form aa V® in (3.25) (see [1], Theorem 4-1 for details). 

For convenience, all asymptotically stable adaptive control. lavs discussed thus far, and their appropriate 
parameter adaptation lavs are summarized in Table II, corresponding to cases l.s, 2.s, 3. a, and 4. a, 
respectively. 

An alternative to choosing c sufficiently small in the above asymptotic stability argument is to 
choose K v sufficiently lsrge. In this esse, the condition on c above can be removed completely by adapting 
ILy on-line. This modification insures global asymptotic stability of the adaptive control system (l.e., choice 
of c Independent of the initial condition V Q ) and is discussed in more detail below. 
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3.2.2 Global Asymptotic Stability-Adapting Ky 

Sinca the velocity gain Ky enters linearly in the control law, it can be adapted as if it were an unknown 
parameter using the same formulation of Sec. 3.2.1. It will be shown that this approach removes the dependence 
of the choice of c on the initial condition V 0 and this completes the proof of global asymptotic stability 
for the adaptive case. 

Consider Control Lav 1 written in adaptive form, where both 0 and are adapted in real time i.e. f 

u - - y qi - v , 2 + V (3.20) 

litre, |L it a time-varying quantity which remains to be specified, and is as defined earlier in (3.3). 

The nonadeptive control lew u° in (3.1) is subtracted from (3.20) to give the following expression, 

u - u° - « T 4 <i 2 + Uji <3- 21) 

whar. AKy • ty ~ Ky .ad ♦ “ 

The Lyapunov function for the stability analysis is given as 

* - v° ♦ 7 * r r* * y « t*{a k v t « w ) . «>o, r-r T >o 0 . 22 ) 

where a new term has been added relative to (3.6a), quadratic in the error Taking the derivative 

of V along system trajectories end substituting control law (3.21) gives upon rearranging 

v - V° ♦ (Aq 2 ♦ cAq 2 ) T Hj# + pT* 

+ TR{J«Af v T - Aq 2 (A <) 2 + cAq^jA*,) (3.23) 


The latter terms are cancelled exactly by the choice of parameter adaptation lews, 

♦ - « - - r" 1 a x T (4q 2 + cAq : ) 

f ( 4 q 2 * c& tl^<*2 T 


(3.24.) 


(3.24b) 


The choice leaves V exactly of the form (3.9) i.e., applying the 0-Ball Leona 2.1, 
V-V°< - UlAq.ll 2 - X-l |Aq_; I 2 


* I n 2 ° 2 j < 

0 1 

In (3.26) and (3.27), all quantities are defined exactly as in (3.12a) and (3.12b) respectively, except 
for V Q which is presently the initial value of V in (3.22). Furthermore, the values of and are 
once again given as 


*1 * I 1 °.ln (K p + C V * * 2 c 2 < m » 

e, - i u - £_ )« (m) 


An important observation is that. 


V o - <Xl|!C v in , IUC V 


CXllKjl) . UK 


Hence, for any choice of 6>0, c>0 and K_ ) - Kp^ > 0, there exist values of 0 , i^, and Ky*K v '^ > 0 (with °nin^v^ 
sufficiently large) such that inequalities (3.26) and (3.27) are satisfied, and Cj^O, $ 2 *® * Q 0*28) end 
(3.29), respectively. Global asymptotic stability of this adaptive control scheme then follows immediately 
by application of Barbalat's Learns to the Lyapunov function derivative (3.25), as was done earlier in 
equations (3.17) through (3.19). 
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Tba global asymptotic stability of adaptive controllers bawd on Control Lava 2, 3 and 4 (whara Ky la 
adaptad on-line) follow from an idantical argument, slnca 4° corresponding to tba nonadaptlva Lyapunov 
function darivatlvaa for thaaa control lava ara of exactly tba aa me fora aa $° In this ana ly a la (saa (1], 
Theorem 4*1 for details}. for convenience, thaaa adaptlva control lava involving adaptation of K* ara 
summarised in tabla II, corraapondlng to caaaa l.b, 2.b, 3.b, and 4.b, raapactlvaly* 


3.3 Global Asymptotic Stability for Adaptation of Control Lava 3, 6 and 7 

Global asymptotic atabillty for adaptation of Control Lava 3, 6 and 7 can ba proved ualng exactly tba 
ttae technlquea aa applied in Sac. 3.2. The only dlffaranca Ilea in alight varlatlone in the nonadaptlva 
Lyapunov function derivative V° which ariaaa In each adaptlva control analyela 

Due to apace limitations, thaaa proof a have bean oalttad, but the raaulta ara summarized In Tabla II 
corraapondlng to caaaa 3. a, 5.b, 6. a, 6.b, and 7. a, 7.b, raapactlvaly. Dataila can ba found in [21] , to 
which the aquation numbers In Tabla II ara referenced. 


4. Adaptlva Computed Torque Method 

It vaa Motioned earlier that In the computed torque Mthod (l.a. v control law 0) the praaanca of the M(q^) 
tarn premultiplying the Kp and Ky galna complicates the Lyapunov analyala and hlndara moat alapla attempts to 
Mka It adaptive. Nevertheless, the computed torque controller la a well-known control lav In the literature 
and la widely applied In practice. Hence, It la uaeful to Invcatigate under what condltiona It can be made 
adaptive, and to what extent adaptive atabillty can be guaranteed. For thle purpoae, we conelder a apeclal 
caae of the computed torque control lav which has acalar galna kp and ky, l.e., 

u° - - »(q 1 )(k p Aq 1 + k/q 2 ) + k(q x ) + Jtfq^q^ + c <V q 2 )q 2 


This la written In adaptive form aa, 
u * u° + Hg4 

where ♦ • 8 - 0 # and the linear in the parametera part has been chosen aa, 

H g 0 - - M(q 1 )(k p Ac ll *fk v Aq 2 ) + kCq^ + MCq^q^ + 


(4.2) 


(4.3) 


Let a Lyapunov function be defined as, 

v - v° + | ♦ T r$ , r - r T > o (4.4.) 

where 

v - \ 4q 2 T M( qi )aq 2 + | (k+ck^Aq^ M( qi )aq x + cAqJ M(q x )Aq 2 (4.4b) 


Then, the derivative of (4.4) along system trajectories Induced by control (4.2) Is given by 


V - Aq 2 T [-k v M(q 1 )Aq 2 + j ^(qj.Aq^qj) 


+ cAq x T l-k p M(q 1 )Aq 1 + M D (q 1> Aq 2 )q 2 + M^q^Aq^qjl 

(4.5) 

♦ (Aq 2 + cAq x ) T H g * + » T I> 

(4.5) 

Let, 


♦ - 9 - - r" 1 H g T (Aq 2 +cAq 1 ) 

(4.6) 

Then, 


V < - o 1 ||Aq 1 || 2 - “ 2 1 1 Aq 2 1 I 2 + Y 12 ||Aq 2 || | f Aq^ | | 2 


+ ^ 21 l|4q i l| + Y 22 l|Aq 2 ||)||Aq 2 || 2 

(4.7) 

where 


o - c(k o(M) - n,(l + o 2 )) 

1 p - 2 

(4.8) 

“2 ' V (M) " I n i | 
0 

(4.9) 

Y 12 * Cn l 

(4.10) 

v 2l ' cn l : t 22 ' I n l 

(4.11) 
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Applying the a -Ball Lemma, It follows that, 


V i- A 1 l|Aq 1 l| 2 - X 2 |U, 2 || 2 

(4.12) 

V ± 

ck o(M) » n,(l + p 2 )) + c n.Cr 2 ) 2 

p - 2 2 

(4.13) 


(4.14) 

c i “ i tk P * C K - cl2 )2< M >» 

(4.13) 

C 2 - | U - Sj)o(M) ; l 2 » 0 arbitrary 

(4.16) 


It Is noted that for any c > 0, both kp and ky can always ba chosen sufficiently large so that £i>0 and 
C 2 >0 (for appropriate choice of t 2 >0 in (4.15), (4.16)), and inequalities (4.13) and (4.14) are satisfied. 

Hence, the adaptive computed torque control law given by (4.2), (4.3) with parameter adaptation (4.6) is 
asymptotically stable when kp and ky are chosen sufficiently large. 

Since kp and k v must be chosen sufficiently large with respect to the Initial condition V 0 (c.f., (4.13), 
(4.14)) this proof of asymptotic stability Is not global (i.e., for fixed k p and ky there will always exist some 
V 0 such that *i and/or *2 ar « not positive). For this particular algorithm, it is presently not clear how to 
adapt k p and ky to Insure global asymptotic stability since the control u in (4.1) is not linear in the 
parameters (6, k p , ky) . 


5* Summary and Remarks 

The adaptive control lavs derived herein, along with the sufficient conditions for stability and appropriate 
parameter adaptation lavs are summarized In Table II. Several remarks are In order at this point in the 
discussion. 

Remark 5-1 All adaptive control laws in this paper were derived for the general tracking control lav. However, 
significant simplification occurs in many of these designs for the special case of set-point control (i.e., 
^2d“^2d“°) * 

Remark 5-2 The adaptive robustness issue remains open. Certainly for parameter adaptation laws of the form 
given in Table II , there will be sensitivities to noise disturbances and unmodelled dynamics directly analogous 
to chose which arise in the linear adaptive control case. It presently appears that many of the robustness 
techniques developed in the linear adaptive control literature will carry over to the nonlinear adaptive control 
application. This conjecture, however, remains to be investigated. 

Remark 5-3 In the nonadaptive case, many of the control laws in Table II are In a form appropriate for 
application of the recursive Nevton-Euler computational algorithm. However, the Newton-Euler algorithm requires 
knowledge of all physical parameters-more parameters than^are actually needed to control the system adaptively 
and more than are actually adapted on-line In the vector 0 of Table II. Hence, the transformation from 0 back 
to physical parameters is required in order to salvage use of the Hevton-Euler algorithm in the adaptive case. 
However, the transformation is generally nonlinear and will not lead to a unique solution unless further 
constraints are imposed. One typical set of constraints arises when only the payload mass is unknown. In 
the more general adaptive case, it is useful Co note Chat all linear in the parameters expressions can be 
implemented directly, since representations of the form H8 are assumed to be available in symbolic form. 

Remark 5-4 The control laws of Table I were derived In [1] for the general desired potential energy function. 

This feature was dropped in the adaptive case in order to simplify the analysis. However, it appears that 
the adaptive control laws developed herein can be extended to the more general esse and this line of research 
presently under Investigation. 

Remark 5-5 A brief comparison with the recent results Psden (10] and Slotlne and Li (11] is useful. In (10] (11], 
adaptive control laws are derived by choosing u to cancel various terms in the Lyapunov function derivative, 
rather than overbounding them (via Lemma 2.1) as was done here. This approach has the advantage of providing 
global asymptotic convergence without adapting gains and Kp. The control laws, however, are by necessity more 
complex than those designs considered here, and do not simplify in the set-point control case. 

6. Conclusions 

A new class of asymptotically stable adaptive control laws is defined by adapting the control laws of (1] 
in a certainty equivalence fashion. These algorithms are proved to be asymptotically stable without approximations, 
linearizations or ad-hoc assumptions concerning the nonlinear manipulator dynamics. Furthermore, the asymptotic 
convergence properties can be made global by appropriate adaptation of feedback gains. On-going research 
efforts are directed at adaptive robustness, computation, and obstacle avoidance problems. 
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TABLE II SUMMARY OF ASYMPTOTICALLY STABLE ADAPTIVE CONTROL LAWS 




Moo. required Global A. W to etc Stability 


R 



Kacjrsiv* Mavton Eultr applicable In nonadapti*a cm 
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1. Abstract 


The pa p er * 


/ 


St. Lucia, Australia 4067 

J 


a nonUnear extension of model reference adaptive 


control (MRAC) techrtTque/to guide a double arm nonl inearizable robot 
manipulator with flexible links, driven by actuators collocated with 
joints subject to uncertain payload and inertia. The objective is to 
track a given simple linear and rigid but compatible dynamical model 
in real, possibly stipulated time and within stipulated degree of 
accuracy of convergence while avoiding collision of the arras. The 
objective is attained by a specified signal adaptive feedback controller 
and by adaptive laws, beth given in closed form. A case of 4 DOF 
manipulator illustrates the technique. 


2. Introduction 

The MRAC technique becomes popular proposition for guidance of recent robot manipulators, with demand for 
precision pointing in difficult conditions, under the action of full scale dynamic forces, and subject to 
uncertainty in parameters. Such manipulators, particularly these used on spacecraft are highly nonlinear and 
nonl incarirable structures (geometric nonlinearity of clastic links, large angle articulation, nonlinear coupling 
of DOF * s , nonignorable gyro and Coriolis forces, several equilibria), while classical MRAC is linear and 
applicable to rigid bodies only. Thus the extension is needed for handling nonl inearity, see [1], and flexible 
links, see (2). On the other hand many robotic objectives, atfain particularly these in difficult space conditions 
require at least two arm systems. Thus the tracking has to' be a double MRAC (mutual reference adaptive control) 
which secures tracking the same model by two arms while ^voiding mutual collision - cf. (3J, [4]. If adaptive 
(self-organizing) control is intended, the tracking relates not to a given path but to a given dynamic target- 
model with prescribed target-parameters. We take the model simple thus rigid and linear, but locally compatible 
with the nonlinear arms regarding equilibria. Each arm is represented as an open chain with n DOF, nonlinear 
characteristics and coupling, elastic links, driven by actuators collocated with joints, under uncertain inertia 
parameters and uncertain payload. The tracking is done in real possibly stipulated time by a designed signal 
adaptive feedback controller and integrablc adaptive laws in che state space, while avoiding collision between 
arms of all the joints (and elastic nodes) in Cartesian configuration space. 


3. Motion Equations 

Lagrange motion equations give the rigid dynamics of the arms in the general format 

j a 1.2 


A J [q J ,s J )q J 


r J (q J ,q J ,\ J ) 


n J (q J .A'* ,s J ) 


B J Cq J ,qV 


( 1 ) 


where 


q J (0 


q o 

j-th arm varying in the known bounded work region 


is the configuration vector of the joint variables q-J 


□ J of the 


of the configuration space R ; q(t) is the correspond- 


ing vector of joint velocities in the specified bounded subset 
u J (t) k U *■: R n are the control vectors in given compact set of constraints 
are the vectors of adjustable system parameters in bounded bands of values A 


iV of the space tangent to 
U ; \ j (t) e A 


and 


s J (t) c S c R k 


i. S 2n 
is an 


uncertainty parameter within the known band S . Moreover A**(q^,s^) arc the inertia n x n matrices obtained in 

the known way from the quadratic form of kinetic energy. The vectors 11^ = (Tl^ , . . . represent potential 

: : : T in 

. ,(*') represent the internal nonpotential acting forces (Coriolis, 


forces (gravity, spring) while =* (T 

gyro, centrifugal, damping structural or viscous, etc.) and B J is the actuator transmission (gear) nonsingular 
n*n matrix. The control vectors u J (t) arc selected for the objectives of tracking and avoidance by adaptive 

feedback control programs (t) 


i I 7 .1 .7 i 2 

p (o (t),q fc (t),q (t),q (t),\ (t),\ (t)) on corresponding products of 


A^ * A^ * A . For convenience the superscripts "j" will be dropped until they are needed to avoid ambiguity. 

Considering the links elastic we introduce the deformation coordinates for the i-th link as shown in Fig. I, 
while using the Ritz-Kantorovitch scries ;xpansion 


z , 
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r i (y i ,tJ “ ^ r i^V r i* t) * r i ty i Jr i Ct} 

and for , w^y^t) analogously, with the exact solution expected for m 


• • . We take ■ large 


S I * length 

Figure 1. Flexible link 

enough so that the Kantorovitch linearization is physically justified. The technical way about it is to 
stepwise subdividing the links between grid as long as the difference of results for successive m's becomes small. 

Having specified (2) we form the vector nCt) • (n^t) , . . . ,n n (t) ) T , where r* i (t) » (r^t) ,v i (t) ,w. (t) ) T and 

following (5) write the hybrid system as 

A Alfql (0 D.lfql fOPjfq) fr(q,q) fri(q,\,s)) fBCq.qj' 

T L ♦ | C . * I W ♦ . ♦ - U (3) 

A‘Aj^ij (o d |nj |o p j -nj ir n (n,n)J l^Cn.s) J o ; 

where A^n,*) , r^n.n) * CH # s) are the elastic correspondents of A, F, II while A c (q,n) , 

D c (M#q»n,n) , P c Cq.n) and the internal damping D(q,q,n,n) as well as the hybrid restoring coefficients 

P(q # n) are matrices coupling the elastic and joint coordinates. These matrices are formed by integrals over 
the shape functions, see [5], Letting 

A A 1 

A(q,n,s) * - 

i*c V 

to be the hybrid inertia matrix which is nonsingular positive definite, we inertially decouple (3): 

(q/i) r ♦ Vi q,q,n,n,\,S) ♦ P(q f n,\,s) * 6(q,q,s)u (4) 

A . I . . T a - 1 T 

where V * A (D^r^r.Dn+r^) and P * A (P^r;^II are successively vectors of nonpotential and potential 
forces and the meaning of the matrix 8 is obvious. The vectors q, q, n, n form the state vector 
xct) * (x 1 (t),...,x N (tj) T » CqttJ ,nCtJ,qtt),n(.tJ) T ' A q * A f) * * A- • A ■- R N , N » 4n , for each am. For 

convenience (4) may be then written in the general state form 

x » f(x,u,A,s) 15) 

with f » (f l# ... ,f N ) of the shape specified by (4) in an obvious way. Formally (5) may be written in the 
contingent form: 

X < { f (x ,u , V,s) I s < S'- s 3) 1 

which for suitable f(*)» p(*)» U*) has solutions x(t) * k(x°,t) , t J 0 , absolutely continuous curves 

through each x° * x(0) in A . We shall consider the class of such solutions K(x°) by exhausting all values 
of s(t) in (5) at each t . 

4. The Reference Model 


We let the given Cartesian "world" coordinates representation of the reference model in general terms 

* F 

ra m 

3 7 n 

with 2n DOF, £(t) t R , and J suitable matrix, be off-line recalculated to the joint coordinate 

format of the rigid linear system 

ii ♦ D O ) u ♦ P ( a ) q si) 
m m m /M m m v nr^ra 


with the 2n-vectors q , q of joint coordinates and velocities, state x (t) - (qit),q (t)) * \ 

mi ra m m 
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£ 

V , P suitable matrices, while A • (A , . . . ,A ) • const c A c ft , * ■ n . Moreover 

mm m mx mx. 


P.(0(q*,n # ) - o 


( 8 ) 


with (q-V) denoting the equilibria of (3) on the surface q • 0. n • 0 . The total energy of the model 

will be denoted by E (£ ,£ ) in the world coordinates and E (a ,4 ) in the joint coordinates, obviously equal 

a m m m ■ a 

to one another. Then 


+ ( p » ( v do 


( 9 ) 




and substituting (7) f 


E .<vV • -ww 


( 10 ) 


The model is selected such as to allow achieving of a stipulated target behavior in the state space. To focus 
attention on something specific and yet general enough, let it be stability of the origin, guaranteed by the 
nonaccuramulation of the total energy i.e, non-negative damping 


while 


VvV s 0 • v s, ' 0 
> 0 


an 


( 12 ) 


in-the-largc i.e. on same CA L * A - A^ , where A L is the set in R enclosing all the equilibria. 
5. Objectives 


Now we consider both arms j - 1,2 and the model together. The block scheme of the system is shown in 
Fig. 2. 



Figure 2. Block scheme of the system 

Define two product 2N-vectors X^(t) = (x J (t) ,x m ( t) )*** * A * A * A" and two i-vectors <v*(t) 3 A^(t) - , 

which vary in A 2 *A generating the product trajectories (X^(X^°,t) f r* (cr*°, t) ) , t 2 0, X^° = X^CO) , 
a J0 = ot J (0) . Then we define the ’’diagonal” sets 


M J * {(X-\r*) < A 2 *A | x** * x m , * 0) , j 3 1,2 , 

and given stipulated u** > 0 f their neighbourhoods 

M ; = uxvrj *■ A-*A | ’• ^ * 1' * J i < ^ ' • > a 2 * 

Moreover we let A be a desired subset of A where we want the tracking to occur, and let t be the 
o L 


2U7 










stipulated time after which the tracking is attained with accuracy 


V 


J 


first Objective: The manipulator arms (1) are mutually p-tracking the target (7) on A q if there is a pair of 

controllers p^(*) # j - 1,2 such that for each solution k^(x^°,t) , t 2 0 of (4) in K(x^°) , the set 
A^ x A is positively invariant: (X^°,a^°) < A^ a A "*(X^ (t) ,a^ (t) ) c A^ * A and given t c> for each 
k**(*) c k(xJ°) the product trajectories satisfy 

(X J (t),a j (t)) < Mj . Vt 2 t c . (13) 



Figure 3. Convergence of product trajectories 
Suppose the transformation from joint to world coordinates (forward kinematics) is given by 

C;* * r 4(q J .n J ) . c - 1 . ,3*2n (14) 

anil denote Z(t) 4 (x*(t) ,x*(t)) . Then we let the set 

A » (Z .. A J | KJ-^| i d, VO.o • 1 3-2n) 

be the collision set between arms to be avoided. We define C A * - A , specified by > d , and let 

A a - (Z .. A 1 I d < < L) 

be the "slow down" safety zone, with e > 0 suitable constant. 

Second Objective: The tracking arms (1) avoid collision iff there is A^ such that for i:ny Z° CA „ and 

any pair k^C*) e K-^x^ 0 ) the corresponding product trajectory 

Z(Z°,t) » CA , vt » 0. (15) 


6. Sufficient Conditions 

We return now to the first objective and specify by N(j(A 2 *A)] a neighborhood of the boundary HA**A) 
of the region A**A . Then let = [.)(A**Aj 11 A q <a 1 , CM^ » (A 2 *A) - and introdi-ce open D J CM J 
such that n * 6 . Further we consider four C l -functions C ' j - N * R , V^( ■) : • R , j * 1 ,2 

with the positive constants 

v{ * vJ(X J ,a j ) , (X-'.a-’) - ) 

s 3 I 

inf V^(X J . ) | (X J ,i J ) . W n CM^ \ i 1 

v^« sup V J ^(X\J) | (X-\.i J ) . t(A**A) i CM^ j 

The first relation obviously requires forming Vg(-) from suitable >(A o *A) taken as its level, or conversely, 
forming }A , iA from levels of suitable V s C * ) ^ • in the latter case a° A , A smaller than these desired will 
be the secure choice. 5 ° 

THEOREM 1: Objective 1 is attained if, given A , A, u there arc prog-aras ( * ) and functions 

v q(‘) » V J f) such that for all ^ A 2 * A , 

a U 


2US 




(1) vJ(X j .a J ) S , V(X J .a J ) < « t , j • 1,2 


(ii) for each u* « p^X’.X*) ; 


v£(X J (t),a j (t)) < 0 , Vs J < S 

along the product trajectories (X^(X^°,t)a-*(a^°,t)) , t 2 0 , j ■ 1,2 ; 

(ill) 0 < V^(X J ,a j ) S vJ* , V(X J ,a J ) « Oi^ , j - 1,2 ; 

(iv) V^(X J ,a j ) S vj\ V(X J ,a J ) < P j " , j . 1,2 ; 

(v) for each ■ p^CX^.X*) there is a constant c^ > 0 such that 

*J(X j Ct),a j (t)) S -Cj , Vs J « S 

along the product trajectories (X^(X^°,t) ,<2(a^°,t)) , t 2 0 , j ■ 1,2 . 
Remark 1: The Objective l holds after a stipulated ® if Theorem 1 is satisfied with selected by 

J 4 

A % 

Cj • — • J ■ l ,2 . 


Theorem 2: Objective 2 is attained if Theorem 1 holds and given d there is a C -function ^ ■* R 


such that for the tracking pair p J («) , for all Z ' CA 
(vi) V a ( 2) > V A (t) , Vz < JA ; 
(vii) for each u** « p^(Z) , 


V A (2(2 w f t)) 2 0, Z < A a , Vs J ' 
along product trajectories 2(2°, t) , t £ 0 . 

^ . / V «*0 1 . A a ^ n 


lX J ,-i J ) <• CM^ , 


PROOF. Suppose sonc 1(2°, tj , t - 0 , Z° « A, crosses 3A at t. » 0 . Then by (vi) , V,(Z(t.}) < V.( 2'*) 

which contradicts (vii). / * 

7. Controllers and Adaptive Laws 

Let us set up 

vi * K (X J ) ♦ E (x ) ♦ aV ; 

o id ni n 

vJ Ajiy ,j j ■ W* aj * J - °i • 

u la J i J , lX J ,,i J ) . M J y ; 

V A * iit.lVj-E/x*) i 

where a- 1 • (.sign i{,...,sign . j * 1,2 . and E (x -1 ) is E (■) with x exchanged for x^ . Choosing 

in a m m 

N in CAj , the character of ^ m (’) specified additionally by (12), satisfies (i) f (iii) and (iv). 

j j • i 

To see that (vi) holds, observe that B (x J ) » t (\ ) of (6) and that increasing the distance 

mm 

iC v -^, x 0 for at least one J from its jA value increases the value of V A . 

To check upon conditions (ii), (v) , (vii) we differentiate (21) - (23) with respect to time 


t. ,tx j ) ♦ 6 fx ) 
m mm 

♦ 

• 

(24) 

i U J ) - E lx J 

m am 

• a^.r* 

, (X J ,.i J ) < cV , 

u 


lEUJ ‘ E.(* J ) 

ram ■ 

♦ aj; j 

. ix j , 1 J ) • cV , 

(25) 

a-*i J , (X J ,v*) . 

M j ; 

y 



IE n l x< )-E.( xl Jl' 


-v xJ >' • 

(26) 


h n (x j ) * VE jn (X J )*f J (T J ,u J , * (Su-P-P^q^) (q,;) . 
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The brackets of the functions 8, V , P dropped for clarity. Moreover CTMjJ are subsets of Qtjj defined by 

cV: E b U J ) 2 c a ixj 

CV: E b (* J ) < E b (x b ) . 

With a suitable choice of initial states the following set of conditions iaples (ii) f (v) and (vii): 

(a) min aax E (x**) i E (x ) , VCX^.a 1 ) , cV , 

j J 

u J s J 

■ax «in E B (x j ) £ E^xJ . V(X J ,a j ) < cV ; 
s^ 

(b) aax ain E (x*) > ain aax E_(x*) , VZ t C*A , 

u< s’ " u» s 1 " 

ain aax E (x*) < aax ain E fx J ) , VZ ' C*A , 

*« ■ u i *> ■ 


for q*0 f nj*0,j«l,2. In the above C*A arc subsets of CA defined by: 

C*A: E^lx*) 2 E^x 1 ) . 

C'A: E b (x*) - E^ix 1 ) . 

(c) ■ E (x ) - c , V M , ) • li.’ . 

m a j 

Observe that for ft* ■ 0 there is no need for adaptation and that the system (4) crosses the surface q • 0, 

*1 - o time instantenously (vertically) so there is no need for control in view of the saoothness of trajectories. 
Conditions (a), (b) are called control conditions helping to design p J (*) , condition (c) is called adaptive, 
helping to design adaptive laws. Let us check that (a), (b) , (c) indeed imply (ii), (v) , (vii). Consider first 

the case E (x J ) ^ E (x ) . Substituting (c) into (23) in view of (ii) we obtain » negative tens * E (x J ) . 
Boundedness of the work space necessitates the power; E^iV) i0 thus U*)- Substituting (a), (c) , and 

(tl) into (24 ) with (1$. we satisfy (v) in stipulated tiae t c . Note that this holds for any initial states. 

The case E m (x J ) ■ E^Cx^) is trivial as then • JE^x^j ' 0 , * fMx^) * c j - * c j • Finally we check 

(viij. Again first let E^tx*) t E^fx*) and obscrve that 0>) substituted to (2h) implies (vii). The case 

E (x*) * U [x 2 ) is obviously trivial. 

a a 

Observe that, with (10), (c) is implied by the following adaptive laws 

‘‘I * * slgn 'ifrmXi ’^i)' {:> 

for r! / 0 , i r i,...,n . Physically the solutions i** ( V°,t) represent the model energy flux which become 
positive or negative depending upon where is located (below or above the surface iJ * 0) thus regulating 

the increment of J to ;ero from anywhere outside the surface *• * 0 . 

8. Modular Oouble RP-Manipulator 

Our technique is illustrated below on the case study of the four DOE manipul ator with two arms shown in 

El*- *- 



I igurc 4. Hie modular 2'RP manipulator 
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The Lagrange equations of notion for each am result in the following notion equations 

( V***2 q 2 )q l ♦ 2.q 2 qj4 2 ♦ ♦ Kv^cos - .,«r ♦ Xfr ♦ 2 q* - u, (J# 

■ 2 q 2 * *2 q 2^ * k A * *2« ,in q l * u 2 ‘ 

Here \ y X 4 are damping coefficients, A 1# AA 2 spring coefficients, g-gravity acceleration, the remainder 

of notations shown in Fig. 4. The superscripts M J", J ■ 1,2 , are ignored for the tine being. take the 
possible payload on the grippers as unknown but within known bounds which nakes n^ specified by 

n S n 2 S a , 

where n, n positive constants. Allowing sin ■ qj - ^gqj , cos « 1 • IqJ , and subdividing th® equations 
(29) by corresponding inertia coefficients we obtain: 

q*i ♦ r A ♦ H i ■ A Uj , l ■ 1,2 (30 

where , . . . 

, 2 Wi q 2 * W* 1 ! 1 

*1 * ■i r * ♦ * 

P 2 " ‘ q 2 q l * l/ *2 X 4 q 2 ' 

- *l q l - *«V q ? * X 2 q ? • 1 «-2 q 2 q t * »"2 q 2 (31 

l *V J "» "2 q 2 


n 2 * * q l * 5* q l * 


8 l * V r " r ""2 q 2 ' 82 " “2 


The reference model is taken as 


The total energy v>f the model is 


\l * W.1 * '- l q -I * gq -2 * ° • 

q »2 * ^«4 q «2 * ***■! * ° ‘ 


WV * l(q »l‘ q i2 J * ^iSl * 2 *\l%2 


Hj f ferrnt 1 at 1 ng it with respect to time and substituting (32), 


Accordingly , 


E « l VV * * V »4 <2 


1 : (q.q) * (B.u.-i .jq, * (8,u,-: jq, . 


Choose (X J0 .,J°) . C*M J , ) > 1,2 and 2° . C*A . Then the control conditions (a), (b) hold if successively 

•in max| (8ju|-i j)q|| - - v BsJ l t 4 |)1 | ) 1 . J * 1 >*' 

“1 *2 1 (5 

min max( (B^-i J Jq J ) - - f j • 1.2 j 

tl J -J 

a, 
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■ax ■in|(BX-r;)q{J > *1" •“*( ( 8 ? u i' r ?)S?l 
“1*1 U l *S 

■ax min((8'uj-rj)qjl ' «in aax( (8Ju|-r»)q|] 
ul m\ “i "i 


Thus w« choose u* such that for q* f 0 , 


lain »ax[ (8ju1-f! )q|J i 

“1 "1 


ami for such u* , we choose uj satisfying 


■in Bax[ (8?u?*r?)q?l < »ax ainj (8Ju** r p^11 
uj ■{ 1 1 1 1 “1 ■$ 


The procedure for u’ and u* is identical utiliting the second inequalities of (34), (35). Assuming 
symmetry of anss: «| - m{ • ,■«•■>■ « 2 « !■,») , r‘ - r* • r , and substituting the expressions for 


! i * D i ’ B i * *•■) * *» 2 • wc obtain the tracking controllers 


. „” 3 ^y 1 .— (■ 1 r J ♦ «(q*) J ♦ i»qjq 2 * V»|q,|q, , Vqj + 0 


^ suitable constant, Vq^ ■ 0 , 

| - I»! rl * *mpM * * 'jlMjltM ,)' 1 . y q| * " 

l suitable constant, Vq| * 0 ; 


and the collision avoidance controllers 




; m4 mi 


u*(t) » j 


V mUmJ ) 1 ‘ . y M* * " 


suitable constant, Vi’l* * <> 

W4 i: - - - -V 1- ' VM 1 ' 0 * 


- TJ 

u^(t) * *j 

[ suitable constant, Vq| * 0 , 

which imply the control conditions (a), (b) for our example. The adaptive laws (24) are 

Vj 0 , U - 0 

*3 * - tsi * n ‘3 )V »3^1 * Ic ) 

‘J * - tslgn ' iC j 

for j * 1,2 . The first two laws vani:>h identically, since by design 3 ’ . 'l * 'm 2 * <N,un,erica * 

simulation of our -nodular case, with the data m y * 70kg, m - 30kg, m * 40kh. r * O.bom, \* { * 20, * 20, 

V « 5 t a 2 , is shown in fig. 5, and confirms the convergence-avoidance required. 
m3 * a4 


.v# 

**1/ /, 

/ A\ 


figure 3. Numerical simulation 
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1 Introduction 


Robots working in a perfectly structured world do not 
need sensing: a structured world in which the dimensions 
of all parts are within tolerance and in which careful plan* 
ning has taken place to ensure that such parts can be 
assembled, a world in which everything is precisely lo- 
cated and everything functions as planned, a world in 
which all necessary jigs(l| have been designed and pro- 
vided. Such a world is the production engineer's dream 
and will probably never exist. Even in today's most au- 
tomated and structured factories there are still operators 
present to “un-jam" the perfect machines when the struc- 
ture gets a little out of line. These machines, which we call 
robots, are of course only programmable uni venal transfer 
devices, machines which can be programmed to move tire- 
lessly from position to position as perfectly as the parts 
and machines they work with. Real robots don't belong 
in factories any more than people do; what is needed in 
factories is well designed automation tended by operators. 

Of course there is a limit to the number of well de- 
signed pieces of automation we can have. In the home, 
for instance, a sewing machine and a food-processor do 
their jobs much better than humans) l), but the modern 
kitchen is slowly beginning to fill up with such special 
purpose devices, which the displaced humans now spend 
their “leisure" time fixing. Humans are needed to provide 
the structure required by these devices. The dish-washer 
functions well ir. its own environment) l), but who puts the 
dishes in and takes them out? Further, the automation 
of many tasks such as dishwashing requires the substitu- 
tion of massive quantities of energy and natural resources 
(The Regular Cycle uses 20 gallons of water which must 
be heated to at least 180 # ) in place of intelligence (“That 
plate's o.k./he didn't use it, just brush off the crumbs,"). 


When any lack of structure occurs, however, we must 
rely on sensors. If something is misplaced, or if something 
will not fit, relying on a sensor-less, geometry controlled 
approach would be a disaster. Sensors have two roles, to 
monitor task execution and to establish the state of the 
world. Both these tasks require the use of a world model. 
Both tasks also require reasoning and planning. Estab- 
lishing ths state of the world requires a sensor strategy 
and the interpretation of sensor data in terms of the world 
model. Monitoring task execution also requires sensor 
strategies and interpretation of sensor readings in terms 
of the world model. Errors, when they occur, are detected 
by the interpretation of sensor data, once again in terms 
of ths world model. If this is to be done reliably a number 
of independent sensors is needed (sensors fail also). Once 
an error state is determined, appropriate recovery action 
must be planned. If a part is dropped on the floor, then 
it may be left, but if it is dropped inside a mechanism 
where it could prevent functioning, then it must either be 
retrieved or the mechanism replaced. Error recovery is 
not simple. 

Of all the sensors that a robot might have, force sens- 
ing is the most fundamental. Blind people /unction quite 
well in the world but people who have no kinesthetic feed- 
back are totally helpless . Consider the well structured 

world of manufacturing with a task fully under position 
control: the detection of any unexpected force is a clear 
indication that something has gone wrong. Force sensing 
can provide this vital information . In any situation where 
complete structure is absent, force sensing becomes pri- 
mary in the sequencing of a task. Consider teleoperation 
where tasks have some structure^). 


/This material is baaed on work supported by the National Science l 
Foundation under Grant No. DMC-841I879. Any opinions, findings, / 
and conclusions or recommendations expressed in this publication are / 
thpee of the authors and do not necessarily reflect the views of the 
National Science Foundation. 
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Th« general-purpose manipulator may be used 
for moving objects, moving levers and knobs, 
assembling parts, and manipulating wrenches. 

In all these operations the manipulator must 
come into physical contact with the object be- 
fore the desired force and movement can be 
made on it. A collision occurs when the ma- 
nipulator makes this contact. General-purpose 
manipulation consists essentially of a series of 
collisions with unwanted forces, the applica- 
tion of wanted forces, and the application of 
desired motions. The collision forces should 
be low, and any other unwanted forces should 
also be small. 

Goerts identifies three clear states: 

1. Motion in free space. 

3. The exertion of a force. 

This work of Goerts influenced the use of force se- 
quencing in the manipulator control system WAVE [3] and 
later that of Inoue[4). Two types of commands were in- 
cluded in a language WAVE describing a sequence of ma- 
nipulator motions. 

STOP terminate the current motion when a force equal 
to the argument is detected, also known as a "guard- 
ed move" (5 1. 

FORCE during the next motion, the force in a give ', 
direction, is to be controlled to the value given as 
an argument. If the force is specified to be zero then 
the manipulator Is “free" to move in the direction 
specified. 

A further command allowed for a force to be applied by 
the manipulator, of course, the manipulator would have 
to be free in the direction In the “Force Vector Assembler 
Concept [6]" commanded manipulator Cartesian veloc- 
ities cculd be modified by measured end-effector forces 
and moments 

v = v 0 - [MJ f (1) 

Off-diagonal elements of the matrix M allowed for mo- 
tion to be specified in directions orthogonal to an applied 
force. A curious side effect of this produced a switching 
phenomenon similar to that described above — a contin- 
uous control system with two states. The end effector 
would trace along an edge until a corner was reached and 
then proceed to trace along the next edge . Unfortunately 
it was not possible to continue in this fashion along the 
following edge. A similar “switching" phenomenon oc- 
curs in a special device for making chamfer-less insertions. 
The pin is brought into the hole at an angle , on contact a 
linkage rotates the pin to align it with the hole axis where- 
upon insertion occurs . These phenomena are, however, 
limited to only two states and do not generalize further. 
Recently this type of control has formed the basis of more 
complex insertion strategies [7] in the form of a “general- 
ized damper" in which the force expected is proportional 
to the velocity error along some direction. Both of these 
strategies are limited to two-state systems. A task to in- 
sert a key into a lock, turn the lock 180 degrees, and then 


withdraw the key, cannot be characterized by such a con- 
tinuous system. It is, however, simple to describe such a 
taak in terms of force/displacement transitions and con- 
trol mode switches such as used in WAVE [8], 

We may then characterize manipulator control into 
two basic states and tranaitions between them: 

• When a manipulator Is moving in free space it con- 
trols displacement and monitors force. The detec- 
tion of any unpredicted forces indicates a serious 
error state. 

• When a manipulator is constrained by the environ- 
ment it controls force and monitors displacement. 
The detection of any unpredicted displacement in- 
dicates a serious error. 

• On contact, or on breaking contact, the control and 
monitoring modes switch. As contact is made the 
reaction forces rise, indicating contact. When the 
desired contact force is obtained the control mode 
switches from displacement control to force control 
and the contact force is maintained at the required 
value. As contact is broken the reaction forces go to 
zero and the control mode switches to displacement 
control with the contact force maintained at zero. 

The detection of contact is a problem for a rigid ma- 
nipulator of finite inertia. When contact is detected the 
manipulator is brought to rest discontinuously — it is 
stopped . The kinetic energy is dissipated by various mech- 
anisms, some potentially destructive. Given the stiffness 
of the manipulator and of the environment there is a 
clearly defined maximum speed at which contact may be 
safely detected and controlled. 

2 Force and Position Controlled 
Degrees of Freedom 

When a manipulator is constrained by the environment, 
force is controlled. There are, however, six environment 
constraints, three of translation and three of rotation. For 
each of these six degreea-of- freedom either force control or 
position control may be specified (3|. 

A robot manipulator closing a door by grasping the 
handle firmly has only one degree of rotational freedom 
— the rotation of the door about the hinge axis. In this 
situation force control is required along ail three trans- 
lation axes and force control is required about the two 
rotation axes perpendicular to the hinge axis. Rotational 
position control is required about the hinge axis. Note 
that one doesn’t simply push on the door handle to close 
the door but one controls the angle of dosing as a function 
of time — how fast the door is closed — “Don’t slam the 
door.'" All the remaining axes are in a control mode 
with a desired force of zero along and about all other 
axes. Notice also in the above example thet the forces 
and displacement control modes may be simply described 
in some orthogonal coordinate frame. In the example 
given, the origin of the coordinate frame would be along 
the hinge axis with one of the axes aligned with the hinge 
axis. If the z axis were aligned with the hinge axis then 
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w* could specify the compliance needed u shown in Fig- 
ure 1. Notice the motion request ROTATE and the motion 
WITH 

FORCE X - 0. 

FORCE T - 0, 

FORCE 2-0, 

TORQUE X - 0. 

TORQUE T - 0 
ROTATE ABOUT Z 
UNTIL 

TORQUE Z - 100; 

Figure 1: A Program to Close a Door 

termination specification UNTIL TORQUE Z • 100. 

Manipulators are controlled by actuators located at 
their joints. To provide for the control of the six Cartesian 
environment variables, position and rotation, six joints 
are required. I» a degree of freedom of the manipulator 
is constrained then attempting to control all six joints 
will result in an over-constrained system; large internal 
forces can result. If one of the joints which contributes to 
motion in the constrained direction or axis is controlled 
in force in place of displacement, the overconstraint dis- 
appears and the system is controllable. This approach 
was first used by Inoue in turning a crank[9j and later 
formed the basis of the compliance used in WAVE[3|. If 
more than one degree-of- freedom of constraint exists then 
additional joints must be force controlled to provide for 
each constraint. In the door closing example given above, 
five joints of a six-degree-of-frccdom manipulator would 
be force controlled at zero force, and one joint, whose 
principal motion was in the door closing direction, would 
be displacement controlled. 

If the motion of the joint selected to provide a degree- 
of-freedom does not correspond completely with the con- 
strained direction then the the position of the manip- 
ulator will be modified in the unconstrained directions 
In turning the crank, the crank would be either slightly 
ahead or behind its correct position. If this matters it 
may be compensated for by modifying the commanded 
Cartesian position[lO,ll], The major problem with com- 
pliance provided in this manner is in selecting the appro- 
priate joints to provide the compliance. While it is always 
obvious which joints should be controlled in any given sit- 
uation, there is as yet no formal algorithm to select these 
joints automatically. Another drawback is that in cer- 
tain motions the joint to provide the compliance changes 
as the motion is made. Consider turning a crank: with 
the crank at the top of its motion, a joint which controls 
vertical motion would be appropriate to provide the nec- 
essary radial compliance, but as the crank is turned the 
radial direction requires a joint which controls horizontal 
motion. Switching between joints can be done[3] but it is 
difficult. 

This form of compliance is very simple to implement in 
manipulators whose actuators are powered by electric mo- 
tors as motor current is directly proportional to torque[3|. 
Joint friction and gearing, however, detract from this sim- 
ple form of control and various attempts have been made 
to close a torque control loop around the joint[l2]. These 


methods have met with only moderate success as the con- 
trol coupled two rigid systems of comparable frequency 
response[l3). 

In 1081 Raibert and Craig developed a control method 
called “Hybrid Position/ Force ControljU],” based on the 
theoretical formulation of the above compliance methods 
by Mason(l5|. In this method not only was the com- 
pliance specified in an appropriate Cartesian coordinate 
frame but the control separation between position and 
force was also performed in Cartesian coordinates. The 
observed joint position of the manipulator was converted 
into Cartesian coordinates and subtracted from the de- 
sired Cartesian coordinate position yielding Cartesian po- 
sition errors. Any position errors in a complying or force 
control direction were then set to xero and the remaining 
errors were transformed back into joint coordinates us- 
ing the Jacobian inverse. These errors were then fed to a 
PID controller to reduce errors in position controlled di- 
rections to zero. Note that no position feedback is applied 
in any complying direction. Similarly, force errors were 
compared to the desi.ed force to yield force errors in the 
Cartesian control frame. Any errors in a nor, -compliant 
or position controlled direction were then set to zero be- 
fore these force errors were transformed into joint torque 
errors by the Jacobian transpose. Note that no forces 
were specified in any position controlled direction. In th? 
system implemented by Raibert and Craig) 14| a force and 
torque sensor was mounted at the wrist of the manipula- 
tor to provide feedback for the force loop. Stabilization 
of the force loop was, however, marginal with resort to ad 
hoc control methods necessary. Once again we have two 
rigid systems of comparable frequency response, the ma- 
nipulator and the force sensor, such a system is very diffi- 
cult to stabilize |l3j. A similar system making use of the 
relationship between motor currents and joint torques has 
also been implemented) 11]. This system, with open-loop 
torque control, docs not suffer from the stability problems 
but does suffer from frictional and gearing disturbances. 

In 1983, Khatib, at Stanford University went one step 
farther and resolved the manipulator joint inertias into ef- 
fective Cartesian Inertias seen from the end-effector of the 
manipulator) 16|. Once Cartesian position errors were de- 
tected, using the hybrid position/force control scheme, a 
PID controller was implemented in Cartesian end-effector 
coordinates to produce corrective accelerations which were 
then transformed into corrective forces by the effective 
Cartesian inertias. The resulting forces were transformed 
into joint torques in order to control the manipulator, 
again using the Jacobian transpose relationship between 
forces and joint torques. Unfortunately, as the manipula- 
tor configuration changes, the rate of change of effective 
Cartesian joint inertia varies much more rapidly than does 
the corresponding joint inertias. This is a considerable 
computational burden. A second problem is that feed- 
back gains are applied in Cartesian coordinates while the 
manipulator is actuated in joint coordinates, and while 
it is possible to set constant high gains in joint coordi- 
nates it is not clear if similarly high gains are possible in 
Cartesian coordinates. No comparison between control 
methods for the same manipulator has been made. 
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3 Stability 

In the previous Section we described the hybrid control 
of position and force. This represent the two states de- 
scribed by Goerti(2|. In this Section we will consider the 
stability of these two modes and the problems of transi- 
tions between them. In the position control of manipula- 
tors high stiffness is desired so that the manipulator will 
be unaffected by disturbances. We would like the manip- 
ulator to move swiftly from position to position, stopping 
as quickly as possible with no overshoot. When the ma- 
nipulator is begin controlled at some position, we would 
like it to be unaffected by the application or any external 
forces of moments. It should act like a very stiff damped 
spring, very hard to deflect and dead-beat in its response 
to external disturbances. This is achieved by the applica- 
tion of feedback. Position feedback is required to provide 
stiffness, velocity feedback to provide damping, and inte- 
gral feedback to provide for the removal of any bias forces. 
Feedback gains are limited by the stiffness of the manip- 
ulator itself. The setting of gains and the design of a ma- 
nipulator for a given stiffness are a difficult engineering 
problems. The result is a system which has a well behaved 
basic response with a number of high frequency modes 
which decay slowly when excited. Such systems behave 
adequately in position mode but perform poorly in force 
control. The force sensor and environment are, unfortu- 
nately, both systems with natural frequency responses of 
the same order of magnitude as that of the manipulator. 
When these are coupled by contact of the manipulator 
with the environment then the resulting system is very 
difficult, if not impossible, to stabilize [17,13,18,19,12.14]. 
Whitney and Eppinger in their papers both indicate that 
stability may only be obtained when the sensor is stiff and 
the environment soft or when the sensor is soft and the 
environment stiff. Unfortunately, a soft sensor completely 
negates the stiffness required for position control. 

The remaining problem is the implementation of the 
transitions between position and force control. This oc- 
curs when the manipulator makes contact with the en- 
vironment. Contact between a rigid manipulator and a 
rigid environment is not well defined — the manipulator is 
moving at some velocity and then it is stopped. Where 
does the energy go? It is absorbed by the compliances 
in the system and, hopefully, dissipated. This can be 
destructive of many mechanical components such as, pre- 
cision gears, shafts, actuators, etc. To run any commer- 
cially available robot into a brick wall would result in con- 
siderable damage! The use of any form of force sensing 
aggravates this problem as the force sensor is typically the 
least stiff member of the system, the most fragile, and ab- 
sorbs all the energy. The design problem of Scheinman’s 
“Maltese Crass" wrist force sensor was not the sensor it- 
self but the force overload mechanism needed to protect 
it from damage. No form of force sensor based feedback 
changes this problem as the time constant of the inter- 
action is much shorter than that of the regulator. On 


contact, the force sensor mm m rapidly increasing force 
and the sensor output goes immediately offecale. The 
time scale of this Interaction is of the order of a few mi- 
croseconds. This signal is processed by a regulator which 
has a well defined minimum time response of the order of 
milliseconds. Contact is long since over before the regu- 
lator can respond and any damage to occur has already 
occurred. The eorUact problem is unsolved for rigid ma- 
nipulator, rigid sensor, rigid environment problems. 

4 Mechanical Compliance 

Based on a careful analysis of a peg-in-hole insertion [20] 
and the force-vector steering method [6j, a mechanical 
implementation of an insertion algorithm was developed 
at Draper Laboratories, the “remote center compliance 
— RCC" [21]. This device provided the necessary com- 
pliance to make peg insertions into low clearance holes 
from a vertical direction. The compliance was provided 
passively by springs. 1 

In the initial version of the remote center compliance 
no displacement sensing was provided, making the device 
very susceptible to damage if the displacement capacity 
of the device was exceeded. However, a later version, “the 
Instrumented Remote Center Compliance — IRCC" also 
provided displacement sensing which could be monitored 
to prevent damage. Both devices could be locked for po- 
sition control to provide the two necessary control modes. 
The device was low inertia with high bandwidth so that 
contact could be made at high speed by the manipulator 
with the small energy of contact (due to the low inertia of 
the RCC) absorbed by the passive compliance. The use of 
passive compliance solves the contact problem* although 
the device must be locked to provide for position control 
and the stiffness k is defined mechanically and may not 
be programmed. 

In order to overcome the locking problem Roberts[23 
investigated an instrumented single compliant link. The 
displacement of the link was used to stiffen the link for 
position control and to soften the link for force control. In 
the position control mode any displacement of the end of 
the terminal link caused the manipulator to move in the 
opposite direction so as to restore the initial position. In 
the force control mode any displacement of the terminal 
link would cause the manipulator to move so as to restore 
the initial displacement. Contact could be detected by the 
deflection of the terminal link and the resuiting motion, 
while the manipulator was brought to rest, absorbed by 
the compliant link as in the IRCC. It was shown that 
both modes were stable. We are currently working on a 
six-degree-of- freedom version of the device and hope to 
show stability and function. 


1 Hanaiusa and Atada made use of a spring loaded band to pro- 
vide compliance between the workpiece, the manipulator, and the 
environment but did not directly address the contact problem. 22 
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5 Qaiicuisions 

The hybrid control of force and position is basic to the 
science of robotics but is only poorly understood. Be- 
fore much progress can be made in robotics, this problem 
needs to be solved in a robust manner. However, the use 
of hybrid control implies the existence of a model of the 
environment, not an exact model (as the function of hy- 
brid control is to accommodate these errors), but a model 
appropriate for planning and reasoning. The monitored 
forces in position control are interpreted in terms of a 
model of the task as ar*» the monitored displacements in 
force control, "'he reaction forces of the task of “writing” 
are far different from thoee of “hammering." The pro- 
gramming of actions in such a modeled world becomes 
more complicated and systems of “task level” program- 
ming need to be developed. 

Sensor based robotics, of which force sensing is the 
most basic, implies an entirely new level of technology. 
Indeed, robot force sensors, no matter how compliant 
they may be, must be protected from accidental collisions. 
This implies other sensors to monitor task execution and 
again the use of a world model. This new level of technol- 
ogy is the “task level ” in which task actions are specified, 
not the actions of individual sensors and manipulators. 

0 Research Issues 

We may identify the following research issues in position 
and force control: 

• Matching individual joints to Cartesian degrees-of- 
freedom. 

• Control of the force of all the links of a manipulator 
not simply control of the force exerted at the end- 
effector. 

• The hybrid position/force control of redundant ma- 
nipulators. 

• Robust rigid manipulator, rigid environment force 
and contact control. 

• Contact transitions 

• Compliant end-effector control of robot manipula- 
tors to provide for both position and force control. 

• Compliant manipulator control to provide for both 
position and force control. 

% Task level systems to provide for the protection of 
sensors. 

• Motion modeling. 


3 Thi« wu graphically demonitrated by Dan Whitney at a con- 
ference in which he marched, arm rigidly outstretched, towards a an 
unknown wall. Without the compliance of a bent arm (to provide 
mechanical compliance) he would not have been able to react fast 
enough (regulator) to avoid hurting himself on contact 
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Abstract ^ 

Ihi PT * > y ni a aU ^iapli aithodi for the design of adaptive forca 
controllers for robot manipulators within tha hybrid control archi 
forca controller ia composed of an adaptive PID feedback controller, an 
auxiliary aignal and a force feedforward tera; and it achieves tracking of 
dealred force aetpointa ia the conatraiat directioaa. Tha position controller 
consists of adaptive feedback and feedforward controllers and aa auxiliary 
aignal; and it accoapliahea tracking of desired position trajectories in the 
free directions. The controllers are capable of coapensating for dynaaic cross- 
couplings that exist between the position and force control loops ia the hybrid 
control architecture. The adaptive controllers do not require knowledge of the 
coaplex dynaaic aodel or paraaeter values of the aanipulator or the cnvlronaent. 
The proposed control scheaee are coaputa t iona 1 ly fast and suitable for 
iapleaontation in on-line control with high saapling rates. 




1 . Introduct ion 

Although control of robot aaaipulators has been studied extensively in recent years, this study has beea 
focused priaarily on position control of aanipulators in free aotion within an unconstrained cnviroaacct. Ia 
asny practical applications, the aanipolator is constrained by the enviroaaent and certain degrees-of-freedr a 
are lost for aotion doe to environaentsl constraints. Vbea the aanipolator aakes contact with the snvironaent, 
the contact forces auat be controlled in the constraint directions, while the positions are controlled 
siaultaneously in the free directions. 

The problea of aanipulator control in a constrained environaeat has been investigated by several 
researchers 111. At present, three aajor conceptual approaches exist for slaultaneous position and force 
control. Paul and Shiaano (2) suggest a aetbod which uses certain Joints for position control while the 
reaaining joints are used for force control. Salisbury (31 puts forward a technique for controlling the end- 
effector stiffness characteristics ia the Cartesian space. Raibert and Craig (4} propose a conceptual 
architecture, based on the analysis of Mason (31, for "hybrid control" which allows forces to be controlled in 
the constraint directions by a forca controller, while a i anl t a neon a ly controlling positions In the free 
directions by a position controller. Raibert and Craig, however, do not prescribe a general and systematic 
■ethod for the design of position and force controllers. Nevertheless, hybrid control has gained considerable 
popularity over the other two alternatives for simultaneous position and force control [6-131. 

The present paper pots forth systematic methods for the design of adaptive force and position controllers 
within the hybrid control architecture. The force controller achieves tracking of desired force setpoints, 
while the position controller accomplishes tracking of desired position trajectories. The force and position 
controller gains are generated by adaptation laws by means of simple arithmetic operations, and thus the 
controllers are computationally fast and suitable for on-line implementation with high sampling rates. The 
adaptive controllers do not require knowledge of the complex dynamic aodel or parameter values of the 
aanipulator or the environment. 

The paper is structured as follows. In Section 2, the hybrid control architecture is outlined and the 
problem is stated. ‘Section 3 addresses the design of force control system using aodel reference adaptive 
control (MRAC) theory. The design of position control system is discussed briefly in Section 4. In Section 5. 
the force and position controllers are integrated in the hybrid control architecture. Finally, Section 6 
discusses the results of the paper and draws some conclusions. 
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2. Problem Statement 


Is tbit section, th« hybrid force/position* coatrol architecture i» discussed briefly* and tba fore# aad 
position traeklag ooatrol problems ara stated. 

Let na coaaidar a robot aaaipalator which parforaa a aaabar of diffaraat taaka is a Cartaaiaa apaea (X). 
Each task, ia general* iawolwea aotloa of the aaaipalator ead-ef factor in eartaia direetioaa aad* 
simul taneoualy, asartioa of forca by tha aud-effeotor on the eaviroaaaat is the raaaiaiag diraatioaa (5). The 
direetioaa of aotioa aad forca dapead oa the nature of tha particular taak to ba performed aad are reflected ia 
the "taak aatris" ia tha hybrid coatrol architecture ehown ia Figure 1. Tha taak aatrix alao eoataiaa the 
t raaaf oraa t iona required to nap the aeaeureaeate £ aad £* of* Joint aacodera and forca/torqaa aaaaora into 
poaitioa aad force wariablaa ia tha coaatraiat fraae defined with reapact to tha taak geometry (4). Not* that 
the desired force aad poaitioa trajactoriea ara also apacified ia tha coaatraiat fraae. For any given taak. 
the a-diaeasioaal Carteeiaa apace (X) eaa ba decoaposed into two orthogonal l- aad a-diaenaioaal subapaens (T) 
aad (Z). where a - l ♦ a. The "poaitioa aubapace" (T) eoataiaa the t direetioaa (i.e.* degreca-of-frcedo*) ia 
which tha aaaipalator end-effector ia free to aove aad along which ead-tffiflto r position ia to be control led. 
Tha "force aubapace" (Z) eoataiaa the reaaiaiag a direetioaa ia which the aaaipalator end-effectar ia 
constrained by aad interacts with the enviroaaeat aad along which the fpiigfil Xfi££i la to be controlled. 

Ia the hybrid force/pos i tioa coatrol problea addreaaed ia this paper* wa consider the "virtual" Cartnaiaa 
force £ acting oa the end-effector ss the aauipalated variable and the position or force of the ead-effeotor as 
the controlled variables [14]. The hybrid coatrol architecture ia based on two Independent aad noa-interact lag 
controllers as shows ia Figure 1; sanely, the position controller which operates ia (T) sad the force 
controller which acts ia (Z). The position controller generates the Cartesian end-effector force required 
to cause the ead-effector notion to track a desired poaitioa trajectory in f Y 1 . The force controller prodaces 
the Cartesian end-effector force £ s needed to ensure that the end-effector force follows a desired force 
setpoint in (Z). Since we cannot physically apply Cartesian forces to the end-effector* we instead conpnte and 
implement the equivalent joint torques needed to effectively cease these forces. The required joint torques 
are obtained fron the Cartesian forces by means of the Jacobian matrix J(£) of the manipulator* where £ ia the 
joint angle vector. 

We shall now address the problems of force aad position control separately la Sect lone 3 and 4 and then 
integrate the results in Section 5. . 


3. Design of Force Control System 

In this section* a simple dynamic model for force control ia the subspace (Z) is described, aad an 
adaptive force control scheme is developed. 

3*1 tonami c_Foige_Mo del 

The foil dynamic model of the ead-effector plus force/torque sensor ia contact with the environment is 
complex [IS]. However, the dynamic behavior of this system caa be modelled approximately by a aass-apring- 
riamper in each degree-of-freedon as shown in Figure 2 and described by the differential equation 

a af ( t ) ♦ d z(t) ♦ k x ( t ) - f(t) (1) 

Generalizing this simple model to the n-dimeaa ional force subspece (Z), the dynamic behavior of the system in 
(Z) can be expressed by the differential equation 

*oZ<t) ♦ D^U) ♦ t,2(t) - E,(t) (2) 

where £U) le 'he axl end-effector poe it ion/or ientat ion vector. N 0 is the symmetric pos i t ive-def la i t m mam 
generalised mass matrix. D Q it the mxm generalized damping matrix. I # is the diagonal mxa generalised stiffness 
matrix and £ s la the axl force vector applied to the ead-effector in the force subspace (Z]. The eleaomts of 
I, sre the "equivalent” translational (force) aad rotational (torque) coefficients of elasticity (stiffness) 
of the system ia various directions in (Z). By an appropriate choice of the { Z } subapace origin, thm axl 
force/torque vector P(t) exerted by the ead-effector oa the environment is related to J(t) by the 
generalization of Booke’s lav as 

?<t) - £<t) (3) 

From equations (2) and (3). we obtain 

A P(t) ♦ B P<t) e P(t) « £ I (t) (4) 

where A - and B - D Q X^ 1 are mxm matrices. Equation (4) gives a simple dynamic model of the system ia the 

force subspace (Z). Since the manipulator dynamics is highly aoal inear* the matrices A aad B in equation (4) 
are dependent on the ead-effector Cartesian position and velocity vectors X and j and also oa the system 
parameters such as the equivalent stiffness and the payload mass* which are represented by the parameter wmetor 
p. Furthermore, due to internal cross-coupling of the manipulator dynamics, a "disturbance" term Cp(I) asst be 
included in equation (4) to represent the dynamic coupling from the position loop into the force loop, whore X 


* In this paper, "position" implies position and orientation and "force" implies force aad torque. 
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it lb •a4-«f(«<ter position vector la (T), liti, a ton realistic aodel for foroo ooatrol it oMilici tt 

A(l.il) lit) ♦ •d.X.t)f(t) ♦ (CU ♦ £ p <X) - E,(t) (5) 

Equation (5) it • tot of bi|k); complex aoaliattr ni ooapled second-order differential eqaatioas. 

3.2 Porti Coatrol taka— 

Im ordtr to ooatrol tlo force/torqac f(t) oxortod by the ead-ef factor oa the environment, lot at employ a 
FID ooatrollor with adaptive gains (Kp(t), Ij(t), KqU)) tad aa aaxiliarj tigaal d(t> la the foroo control law 

i a (t> - r r (t> ♦ iy<t > Kt) ♦ EjU) / 0 ‘i(»)4t ♦ Epit) id) * i(o <«) 


wkoro Zgit) it tko mxl vector of doairod foroo trajectory mood at • foodforward tern, aad tko axl foroo 
traokiag-orror vector |(t) - l 9 tt) - lit) it tko deviation of tko aotaal (aoatarod) foroo froa tko dotirod 
valne. Siaoo ia praotioal applioatioat tko dotirod foroo trajectory it vary oftoa a ooaataat aotpolat f r (t) • 
f r » tko FID ooatrol law it portioalarly toiUklo for tkia sitaatica. Fsfthcrmorc, tko aaxiliarj tigaal 4(t) 
compensates for tko oroaa-oompliag tora tad tko tiao tad paraaotor variations of A aad 9 at Ir loot. No to 
tkat tko foodforward tora £ r (t) la iaoladod ia tko ooatrol law (d) aiaot idoallj v« wait J(t) * la 

sqaatioa M), tko gaiat of tko FID ooatrollor. aaaoly K z (t) aad I D (t). aad alto tko aaxilitry tigaal 

d(t) aro adaptod ia real-time to aoooapliak foroo aotpoiat traokiag ia apito of tko aoaliaoar aad pottikly tiao- 
▼try lag behavior of tko ayatoa aodol (9). 

Oa apply lag tko liaoar ooatrol law (d) to tko ayatoa aodol (5), wo oktaia 

* !<»> ♦ ■ id) ♦ fd> ♦ c p - l f d) ♦ ip*d) ♦ ^**(t)dt ♦ Epid) ♦ id) (7) 

Dtia. 8(0 - t,d) - Ed) aad aotia. tkat 8(t) • -fcd) aad 8<t> - *E(t) for a ooaataat doairad forca. aqaatioa 
(7) eaa ko writtoa at 

id) ♦ A*l<» ♦ Ep) 4(0 ♦ A" 1 (I ♦ Ep) Bd) ♦ A _1 «! B*(t) ■ A -1 t£ f - i(t)) (I) 


• kora B*(t) • Jj' £< t )d t ia tka aal lata.ral arror r.ctor. B^aatioa (I) eaa ba aapraaaad ia ataadard atata- 
apaoo form at 
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dt 
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where g # (t) 


it tko 3axl segmented orror vector. Eqestioa (9) coast itatot tko "tdjasUklo system" ia tko MAC fr 


Now, in tho ideal aitaatloa. tko doairod behavior of tko force orror E B (t) ia described by tko komogoaooaa 
differential eqeatioa 


E.U> • 0 3 £.(t) o D 2 E.(t) ♦ DjE^U) • 8 


( 10 ) 


where Dj. Dj aad Dj are cooataat warn matrices which are choaea each tkat equation (10) ia stable aad embodies 
the desired performaace of tho force coatrol system. By choosimt Dj. Dj sad Dj ss diagoaal matrices, the force 
errors will be deeoapled; for iastaace 


i^iit) • dj|i Bl d) • djjE.jd) • d lt E^,(t) • o 


(ii> 


where the coefficients djj, dj 4 and dj j are chosea sack that the tracking-error E|(t) ■ P ri lt) - has a 

desired behavior and djjdj^ > d 3i to easare stability. Hqaatioa ( 10 ) caa be wrlttea as 


where j.(t) 


0 t B 0 \ 
i«d> - 0 0 i. I «. 

\ -»i *»J / 

‘ ( &•'-') 


(t) - D ^(t) 


( 12 ) 


is the 3 ax 1 desired error vector. Eqaetioa (12) coastitatea the "reference model" ia 


the coatext of MAC theory. Since the initial valaes of the actaal aad desired forces are often the same, the 
initial arror g.(0) is eqaal to xero. aad hence froa equation (12). g.(t) - exp(Dt) • 1 .( 0 ) - Q for all t. 
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aayaptotioal ly, froa tifiriMi (141 wo fH«lr# 


0 I ■ ♦ < *i, 

r'kJ 


/ o o o\ /«i*«o o\ 

o o o I ■ ttj* I o »ii, o 

\r% A-ii, a'** d / \ • o tx l / 

. 4 /•**• 0 • \ 

♦ 0? a £-<«•*;> 0 >,i. « 

41 \ 0 0 ra 1 */ 


• " * " 4moIi» UiMveiltloi, Ca 1# % v 7l ) art tuliri. Uj. |g. rj) or pooitUo 

• ••lari* n4 L It as aaa eoaataat aatris to bo apooifiod lator. It oqaatioa (IS). 0 o aa4 Qj aft tyaaotrlt 
poaitivo-4o fialt# 3as3a aatrioot, Qj ud q( art ayaaotrio ftiltlvt toa i-4of iaito 3ai3a aatrioot, aa4 tko 
ayaaotrio poaitivo-4of iaito 3aa3a aatris 


/ “l *4 «3 \ 

- "4 «3 

\"3 "3 «#/ 


it tki aolatioa of tko Lyapviov oqaatioa for tko roftroaoo ao4ol (12), aaaoly 

M D ♦ D'M - -N (14) 

wkoro N it a ayaaotrio pot it ivo-dof iaito 3ai3a aatris. Ia dorioiag oqaatioa (13). tko aatrioot A. B. aa4 £. I* 
tko robot ao4ol (3) art aataaod to bo aakaooa aa4 "alowly t iao-varyiag" ooapar#4 witk tko a4aptatioa olgoritka. 
tiaoo tko to aatrioot eaaaot okaago a igaif ioaatly ia took taapliag iatorval wkiok it of tko or4or of t 
aillitoooad. Now. ia ordor to aako tko ooatrollor adaptatioa lava iadoaoadoat of tko aodol aatris A. wo okooto 


o 0 • J A* i Qj • *• ; oj • Ajt **!' 1 ; qJ ■ IA«J -1 

wkort (A., 6,} art poaitivo aad soro or poaitivo aealart. aad 


(: : 0 


ia a ayaaotrio pot itlvo-dof iaito 3as3a aatris. 


Sobititatiag froa aqaatioa (13) iato oqaatioa (13). aftor aiapl if ioatioa wo obtaia tko adaptatioa la 
i<t> - 6jfl( t ) ♦ 6 2 i<*> (1<) 


*I«t) - UjI.U) ♦ «2 

i p (t) • A x l a u) ♦ »2 4i ta(t> B’ 101 

*D<t> - Tlla(t) B* (t)Ll ♦ T2 B’<t)Ll 


S (t) * Mj |.(t) ♦ Uf |(t) ♦ H ( B(t> (JO) 

aad Mj, Mj, Mg art appropriato aabaatricoa of N. Thoa, tko roqairod amsiliory aigaal aad P1D coatrollor gaiat 


aro obtaiaod at 


i(t) - j(0) ♦ »J /^aiDdt ♦ *ja(t) 

Kj(t> - 1,(0) ♦ a(t) e**(t)dt ♦ «2&(t) £•*< 
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CM) 


1,(0 - 1,(0) ♦ ) t S'lit) I' (OOt ♦ SjgO) I'CO 
IpCO - 2,(0) * Ti / # *lCO i’COLOt ♦ TjfCt) 4* COL (24) 

The 4 mm iMtrtl Iw ii IMb |im by 

1,(0 - * t (0 ♦ 1(0 ♦ I|(0 / 0 *I(O4t ♦ K,(0|(0 ♦ 11,(04(0 (IS) 


It !• Mt*4 tilt tkt luiilirjr algaal J(t) mb be iiMrititf by i W l D cBBtrtllir ifivta by U« feret error f(t) 
liBM, frM HUtlfBI (J0H11), 4<t) MB bt BlfrMMi tt 

|<t) - 4(0) ♦ UjMtfllU) ♦ ♦ *2* 3 ]|(t) 

♦ l*i*s ♦ *a"s> / 0 * {/ 0 ‘»<t>4i}« 

la frutlMl iayliBMtitiBa of toy forto aoatrol Iiv, iifftMitlitioa of tba aoiey fort* Miiinatit f(t) 
it BtiBiinhU aai, toffBtvir, dlf ftreet la t loa of tht toatttat fortt aetpciet [, prodaeee aavaattd lapel aet. 
Tfcit ergeaaat tafgtttt that tla iirivitlvi |(t) la eqattiees (20), (24) tag (25) aast M reflated by 
atiag eqaatloa (I). Tbia yields tba llatar liiftlvi fortt aoatrol lav 

Eg(t) - 1,(0 * 1(0 * 2j(t) / 0 *l(O4t ♦ K,(t)|(t) - 2,(t)Z<t> (20 

• kick 1* *kc«« la Olga,* J. *k*r* 2,<t) - I D (OC, I* tk* a, a velocity ftc4b*sk gala ■Ctrl, *a4 tk* tea 
LyCO^U) repreeeata velocity daaplag. Cboeeiag L • <EJ*)*» tk* tgaplatloa lava aov btooat 


l(t> - 1(0) ♦ 


( ft(t)dt ♦ 

(27) 

«g(l) - K{(0> 

♦ *1 

^^g(t)**'(l)4l ♦ ogld )«•'(») 

(21) 

1,(1) - 1,(0) 

^ fl 

/pVoi'UMt ♦ ljg(t)B' (i) 

(29) 

2,(0 - 2,(0) 

* Tl 

/‘lltli'dldt - «]i(i)Z'(t) 

(30) 


whore 

g(t) - ♦ M 5 f(t) - «gi(t) (31) 

• •4 Nj - g < i # . It eaa l« sbova that by proper ttltttloa of aatria N ia th« Lyapaaov eqaatloa (14), the 
aabaatriota Mj # ■ ^ tad Mg la aqaatioi (20) taa bt aide tqaal to tha dttirtd vtlatt Wj, t p tad T D • I,!,* 1 
rttptt t ivtly, tad hoatt eqaatioa (31) btcoaaa 

t(t) - *!4*(t) ♦ Wp4(t) - (32) 

vhtra W V p aad I T art tha axa diagoaal vtightiag aatriett oh otta by tho dotigaor to rofloet tha rtlativt 
aigaifiotaot of tha iat#|ral orror R*. tha fortt trror R, aad tha velocity £. raapaa t Ive ly. Froa cqaatloaa 
(27)-(30), tht gaaaral axpraaaloa for a typical aoatrollar gala E(t) vhloh acta oa tha algaal y(t) to gaaarata 
tha tors K(t)y(t) ia tha aoatrol lav (2d) eaa ba vrittaa at 

C(t) - 1(0) ♦ m, f \ i(Oi'(Odt ♦ k}g(Ox' (t) (33) 


vhtra iti aad arc •••l**’ gelta. This eoapatatioa eaa ba parforaad by a aiapla "adaptatioa aedala" show la 
tha block diagraa of Flgvra 4 (For g(t) ( va aat y - 1). Tha adaptatioa aodala aeta oa tha tvo lapat aigaals 
g(t) aad y(t) to prodaaa tha oatpat algaal K(t)y(t). The force aoatrol lav (2d) eaa thca ba eoaatrvcted by 
parallel coaaaetioa of foar aach aodalaa. 

The force coatrol aehaaa developed ia this aactioa la axtraaaly aiapla, aiaea tha adaptatioa lava (27)- 
(30) gaaarata tba aoatrollar gaiaa by aeaaa of aiapla latagratioa aaiag, for iaataaea, tba trapaaoidal rvla ia 
eqaatioa (33) eaa be iaplaaeatad aa 
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NO - Ni-1> ♦ j*j • Ji (a<Ol‘(i) ♦ ♦ Bj(i<l)l*(l>l <J4> 


where tha integer I denote* the stapling laetaat aad T ( |« tha eaapllng period. At a rmlt, tha foraa aoatrol 
law (24) aaa ivaluti 4 warp rapidly tad eoaseqaeatly. tka foraa aoatrol aafcaaa aaa ba iaplaaeated for raal- 
tiaa aoatrol with high aaapliag rataa (typical ly I Us), Vigh aeapliag rata i< wary daairabla la foraa aoatral 
appliaatioaa aad yialda improved dyaaaic pc r f oraenoe. Tha adaptatiaa laws (27)-(30) do aat reqnir# tha 
aoaplax aoaiiaaar aodal of aaalpolator dyaaaic a (5) or aay fcaowladga of paraaatara of tha aaaipmlator or tha 
eaviroaaaat. This fta daa to tha faat that tha adaptive foraa aoatro liar ha a "laaralag capabilities* aad aaa 
rapidly adapt itaalf to groat ohaagaa ia tha Baaipwlator or tha aavirouaeat paraaatara. 


4. Daaiga of Bosltloa Coatrol Syttaa 

la thia section, a dyaaaie aodal for poaitioa aoatrol ia tha tubepauc *T) it datorlbad tad aa adaptive 
poaitioa aoatrol tahaaa la briefly explained. 

4.1 Piaaaig..ffljiiip> Jfeiil M 

Tha dyaaalat of tha ead-effccter ia tha Cartaaiaa spaee (X) aaa ba rapraaaatad by (141 

MVI ♦ d(|. X> ♦ *<X) ♦ l • E <») 

whara A it tha Cartaaiaa aa«a oatrix. g it tha Cartaaiaa aaatrifagal. Coriolia aad frictioa vector, ± it tha 
Cartaaiaa gravity loadlag vector, f ia tha foraa vaetor asartad by tha aad-affactor oa tha eavlroaoeat. aad [ 
ia tin gaaarallsad "virtaal" Cartaaiaa foraa vaetor applied to tha cad-ef factor, la tha poaitioa eabapaoa (T), 
equation (35) caa ba writtaa aa (17) 

Ad. X. *> ICt) ♦ B(g. X. p) X(t) ♦ C 0 (J, i, p) |(t) ♦ 5f<P> “ E y <t) (34) 

a 

where tha txi aatricea A, B, 0 o are highly ooaplea aoaiiaaar faaatioaa of X. X aad tha ayataa paraaatara p, £f 
rapreaeata tha dyaaaic coupling affect froa tha force loop iato the poaitioa loop which ia a fraction of tha 
foroa yctor f ia (Zl. aad £ y ia tha til force vector applied to tha cod-effector ia tha poaitioa sabapaoa (T). 
Eqaatioa (34) ia a aat of highly coaplas aoaiiaaar aad coaplad second-order diffaraatial equations. 


*-i Po» it tea C o atrol Stk. Bt 

n. Cittoln po.ttto. coatrol tek.a. it d.*«lop«d folly la t.fcr.ac. (111. For tko i.k. of coaplo toao.i, 
the result! are soooarixed ia thia lactioa. 

The linear adaptive position control law ia given by 

E y (t) - f(t) ♦ * p ( t >E p (t) ♦ tvil'ipi*) * C(t)g(t) ♦ B(t>i(t) ♦ A(t)|(t) '(37) 

e s shown in Figure 3. wfcerr g(t) i« the til reference fdeeired) position trajectory vaotor. Ep(t) - g(s) - J(t) 
i» tha fil position tracking-error vector. |(t) ie an anxillary signal, and (Zp£p ♦ LyEpI and (Cg ♦ Bg ♦Agl are 

the contr Ibnl ions dne to the feedback and feedforward controllers respectively. The required auxiliary signal 
aad controller gains are adapted according to tha following laws: 


t< «> * t<°> ♦ » x 

*p(*» - V<» * ”1 / 0 t i“>Ep<‘>'»‘ * ‘-jrftlg.Mt) 

V‘> • c.(o> ♦ ni JJj t i(t)Bp(t)4t ♦ <i 2 i(t)B^(t) 

C(t> - C(0> ♦ Mj j^*£<t )!’ ( t )dt * M 2 £(t>|'(t) 

B(t) • B(0) ♦ T , ♦ T 2 l(t)i-(t) 

At t ) - A(0) ♦ Aj jjj t {(t)|(t>dt ♦ X 2l (t)i(t) 
where tha 1x1 "weighted” poaitioa error vector g(t) ia defined as 


(38) 

(39) 

(40) 

(41) 

(42) 

(43) 
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(44) 


x<t) - fpBpd) ♦ w^U) 

la eqea t lone (38)-<43). ($}« eg, qj, M j, fj, Xj ) arc positive lealiri, (6j, v 2 . q 2 , p 2 , y 2 , l>2* • *• positive or 
taro scalars, and W p tad f T ara weighting matrices specified bp tba designer lo reflect tba relative 
significance ol tba position aad valoalty errors Bp Bp* Note that froa equations (38)-(43), tba eoatrollar 
galas aaa ba computed aslag tba mi adaptatloa "nodule" as ia Saotloa 4 (eqe. 33) shov* ia Figara 4. It la 
seen that tba poaltioa aoatrol scheme is extremely siapla slaaa tba aoatrollar galas ara obtaiaad froa 
aqaatioas (3IM43) by siapla iatagrstioa (saab os trspasoidal rala) sad tbas tba aoapatatioaal tlae rtqairad 
to evaluate tba poaltioa aoatrol lav (37) Is aatraaaly abort. Tbas. tba positioa aoatrol scheme aaa ba 
iaplaaaatad for oa-liaa aoatrol vitb bigb saapliag rataa (- 1 KBs), rasaltiag la Improved dy as a la par fora sac a. 


9, lybrld Force /Posit Ion Coatrol System 

Ia faatloas 3 aad 4. tba Cartaslaa cad-effector forces £ a and £- ara gaaaratad by tba force sad positioa 
coatrollers to aaaoapllsb force aad positioa traebiag ia (Z) aad (T) respectively, tiaca Cartesian forces 
eaaaet ba applied to tba end-cf factor la praatlaa. tbasa aad-af factor forces aast ba aappad late tba eqaivalcat 
Joist torqaas. Tbas. ia order to iapleacat tba force aad positioa coatrollers. tba aoatrol lav la Jolat space 
is gives by (IP) 

/e.uA 

I<t> • J*(l) ) <45 > 

\E y <‘>/ 

vbara £ is tba aal jolat eagle vector. X is tba asl Joist torqaa vector, aad J is tba aaa Jacobies matrix of 
tba ■aaipalstor. vitb appropriate raordariag of tba columns of J if accessary. 

It is iaportaat to aotc that sltboagb the force sad positioa coatrollers are separate ia the hybrid 
control architecture, there arista dyaaaic oross-ooapl lag froa tba force coatrol loop into tba positioa coatrol 
loop sad vice versa. This coapliag is due to tba fact that tba ead-affcctor dynamics la tba Cartesian space 
(X) is strongly cross-coapled; i.s. tba application of ead-effeetor force ia say direction affects the end- 
effector positions ia all directions. The cross-coapl lag affects ara modelled as "disturbance" terns and £f 
ia the force aad position coatrol loops. He adaptive coatrollers are capable of compensating for these 
disturbances aad maintaining a good tracking performance. Tba ability to cope vitb cross-coapl lag effect r in 
the hybrid coatrol architecture ia aa important feature of tba adaptive coatrol schemes of Sections 3 and 4. 


i. Discussion aad Conclusions 

Simple adaptive force aad position coatrol schemas for maaipnlators ia a hybrid coatrol architecture arc 
described ia this paper. The coatrol schemes are compatat loaal ly fast aad do not require the complex dynamic 
model or parameter values of the manipulator or the environment. The force aad position control loops are 
stable since the design is baaed ca the Lyapunov method which guarantees stability aa a by-product of the 
design. 

There are certain differences between the proposed approach and the conventional hybrid control of Raibert 
aad Craig [4). Firstly, ia the present approach, the force aad positioa coatrol problems are formulated ia the 
Cartesian space with the ead-effector Cartesian forces as the manipulated variables; whereas in [4], the 
problems are formulated ia the joint space. The proposed formulation results in computational improvement 
since inverse Jacobians are aot needed ia the coatrol loops. Secondly, in the proposed approach, the "task 
matrix" operates on the measured variables so as to produce the positioa aad force variables that need to be 
controlled; whereas in [4], a selection matrix and its complement are used after formation of track ing-e r ror s. 
The present approach seems more straightforward and appealing than the conventional approach. 

An attractive feature of the adaptive coatrollers designed la this paper ia their abilities to compensate 
for dynamic cros s-coupl lags that exist between the positioa aad force control loops in the hybrid control 
architecture. Furthermore, the adaptive force and positioa controllers have "learning capabilities" to cope 
with unpredictable changes in the manipulator or environment parameters such as the stiffness This is due to the 
fact that the controller gains are adapted rapidly on the basis of the manipulator performance. The low 
computational requirements make the proposed coatrol schemes suitable for implementation ia on-line hybrid 
coatrol with high sampling rates. 
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of Robotic Manipulators 
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i 

1 . Abstract 

t his pape r , the problem of position and force control 
compliant motion of the manipulators 1s considered. The external force and 
the position of the end-effector are related by a second order impedance 
function. The force control problem Is then translated Into a position 
control problem. For that, an adaptive controller Is de. gned to achieve 
the compliant motion. The design uses the Liapunov's direct method to 
derive the adaptation law. The stability of the process Is guaranteed from 
the Liapunov's stability theory. The controller does not require the 
knowledge of the system parameters for the Implementation, and hence Is easy 
for apr at ions. 

2. Introduction 


jj ip 5 O' 

for the 


While position control is appropriate when a manipulator Is following a trajectory through 
space, when any contact Is made between the end-effector and the manipulator’s environment, 
position control may not suffice. Precise control of manipulators, In the face of uncertainties 
and variations In their environments, is a prerequisite to feasible application of robot 
manipulators to complex handling and assembly problems, In industry and space* An important 
step toward achieving such control may be taken by providing manipulator hands with sensors that 
provide Information about the progress of interactions with the environment. Properly applied 
force control can reduce the positioning accuracy necessary to perform a given task accurately, 
and in fact make possible assembly tasks which would be otherwise impossible. 

The problem of posit Ion/ force control has attracted many researchers In the recont past 
years [1-5]. Among these works one can distinguish two different approaches. The first 
approach Is aimed at providing the user with a means of specifying and controlling forces and 
positions In a non-conf 1 1 ct 1 ng way, [1-3]. This Involves specification of a set of position 
controlled axes and an orthogonal set of force controlled axes. The second aporoach is aimed at 
developing a relationship between Interaction forces and manipulator positions, [4,53. This 
way, by controlling the manipulator position and specifying Its relationship to the Interaction 
forces, a designer can ensure that the manipulator will be able to maneuver In a constrained 
environment while maintaining appropriate contact forces. 

In the first group, Paul and Shimano [1] partition the cartesian space and find the best 
Joints to force servo to approximate the desired force and position commands. Ralbert and Craig 
[2] involve alt Joints in satisfying the cartesian position and force commands simultaneously. 
Whitney [31 arrives at a single loop velocity control scheme with the net effect of controlling 
the contact force. In that paper, the Impedance matrix approach establishes a connection 
between the two different approaches mentioned above. In all the above works, the structure of 
the controller depends on the kinematics and dynamics of the manipulator and of the environment. 
That Is, If the end-effector of a manipulator in motion encounters a point with new constraint, 
then the controller structure must be changed. In the second group, Salisbury [4] defines a 
linear static function that relates Interaction forces to end-effector position, by a stiffness 
matrix In a cartesian coordinate frame. Monitoring this relationship ensures that the 
manipulator will be able to maneuver successfully in a constrained environment. Kazerooni, et. 
al. [5] extend the previous work [4] and define a generalized mechanical impedance for the 
manipulator which 1s used for the compliant motion control. Their approach is an extended 
frequency icmain approach of Salisbury's stiffness control. Also, their design is stable and 
sho.'s robustness In the face of bounded uncertainties. In the second group approach, the 
co’itrol ler ' s structure does not depend on the kinematics and dynamics of the manipulator and 
that of the environment. However, In both groups, the controller requires the knowledge of the 
parameters of the system. 

In this work, the concept of mechanical Impedance, [4,5] is used in order to relate the 
external forces to the position and orientation of the end-effector. Hence, the problem of 
force control Is recasted in the position control problem. The objective Is to design a 
controller for the manipulator, so that the perturbed dynamic relationship for the overall 
system is given by a second order impedance function. For tha*- , a model reference adaptive 
controller is deslgred [6,7], where the desired impedance function is used to select the 
adaptive control model. The direct method of Liapunov Is used for the derivation of adaptation 
laws. This guarantees the stability of the overall system, 
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Consider a manipulator with n Joints, providing n degrees of freedom, 
of suoh manipulator la given by 


The dynamic equation 


M(q) q ♦ h(q,q) ♦ g(q) • T 


where q is the n-dimenalonal veotor of Joint angular posit Iona, q and q are, respectively , the 
vectors of Joint angular velocities and Joint angular accelerations, M(q) is the nxn symmetric, 

positive definite Inertia matrix or the manipulator, h(q,q) Is the n-dlmenslonal vector of 
Coriolis and centrifugal forces, g(q) is the n-dlmenslonal vector of gravitational forces, and T 
Is the n-dlmenslonal veotor of torque Inputs, applied to the manipulator. 

Let Aq be the perturbation of the Joint angular position vector q, from q # , and AT be the 
perturbation of the Input torques, from T t . Then the linearized dynamic equation la given by 

M(q a ) Aq • G(q a ) - AT (2) 

where, G(q # ) • [3g/3q,. . .3g/3q n ] for q - q«. 

The Joint input torques applied to the manipulator, the external forces on the end- 
effeotor, and the actuator torques are related by 

AT - LAT ♦ JV (3) 

a c 

where AT, AT and AF are n-dlmensional perturbations of the Joint input torques, the actuator 
torques and® the end-effector external forces, and is the Jacobian matrix which transform 
Joint angle coordinates to end-effector position and orientation. Also, ,the dynamic equation 
of actuators are approximately given by 


AT 

a 


A AT 


a a 


• v u 


where 


(4) 


A 


a 


B a 


diag [-1^ * ♦ • • 
diag [b, b n ] 


and AU is the n-dlmensional vector of actuator inputs, [5]. 


From equations (2), (3,) and (4), the dynamic equation of the manipulator and the actuator is 
given by 


AX - A AX ♦ B AU ♦ 0 AF (5)* 

Aq - C AX 

where 


r, T .* T iT «3n 
AX - [Aq , Aq ,AT a 3 e R 



C - [I 0 0]. 

and the pairs C A , B ) and (A,C) are respectively controllable and observable. 
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Modal lUfarenoe Adapt lv Control 


In this section, a eontrollsr la daalgnad ao that th# dynamic perturbation aquation of tha 
overall aloaed-loop manipulator system la given according to tha Inverse of the deeired 
lepedanoe. To achieve thla* a Model referenoe adaptive control strategy la employed. The 
reference model la choaen auoh that ita tranafer matrix la identical to the lnverae of the 
dealred mechanical lapedanoe. Henoe, the dynaalo equation of the reference model la given by 


- v x . ♦ V' 

«<J_ * CM 

m mm 


where 

A . 

c » 



( 6 ) 


such that 6X is the 3n-dlmensional incremental vector of model's Joints and actuotors values, 
6F la the n^diaenalonal vector of incremental external forces. The tranafer function matrix of 
the model la given by 

0.(9) • 4q (9>/«P(9) - C [sI-A ]*’b 
mm m in m 

such that the two dominant poles of the model are Riven by 

JC (s) - (Jo 2 • k,s • k ) -t (7) 

cm io 

where J la the Jacobian matrix, and J, k Q , and k, are reapectlvely the inertia matrix, 
stiffness matrix, and damping matrix of the desired mechanical Impedance, given by 

• (Js^ • k,a • k ft ), BY ■ model’ll spatial displacement . 
m i o is 

Let ua define the state error to be 

e • 4X -«X. (8) 

m 

Subtracting aquation (6) from (5), we get the dynamic equation of the 9tate error aa 


a • A e • (A -A)4X ♦ (B -D)«F - B4U. (9) 

mm m 

Let ua now choose the input torque to be 

5U - K x 6X • Kp6F • K e « 00) 

where K , K » K p are variable gain matrices with appropriate dimensions. Plugging <5U from (10) 
Into (9) x , we c gec 


e 


(A -BK )e ♦ ( A -A-BK )6X • (B -D-BK p )«F. 
me m x m r 


(U) 


The problem, now, is how to vary the feedback and the feedforward gain matrics, K %§ Kp and K e , 
such that equation (It) is stable and the state error e approaches zero, according to a 
prespecified transient behavior. 


To achieve perfect model following, the state error and Its derivative should become zero, 
that is e • e • 0. The conditions for perfect model following are given by 



0 

0 

0 


02) 


Furthermore, under perfect model following conditions in (12) the error equation (11) will 
become 


e * A e. (13) 

that is, the transient behavior of the state error is determined by the constant matrix A, which 
is defined by the designer and is Hurwltz. 
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5. Adaptation law 


Th« controller gains or the adaptive system should be adjusted such that the overall 
closed-loop system is stable and follows the reference model. The direct method of Liapunov may 
be chosen for determining the adaptation law, [6,7]* 

Let the corresponding Liapunov function for adaptation be given by 

V- 1 1 • 1 1 p* 1 1 B B "D-BKp 1 1 „♦ 1 1 1 1 s + 1 1 *-* B *BK t 1 1 M ««) 

where P,R, and S are 3n x 3n arbitrary positive definite symmetric oonstant matrices. Also, the 
quadratic norm for any matrix F and any positive definite symmetric oonstant matrix 0 is defined 
by 

||F|| 0 - tr[F T GF3, where tr • trace* 

The function V is positive definite, exoept when there is a perfect model matching it becomes 
zero. Differentiating V, we get 

V - . T (PA*AP). _ _ d (,5) 

♦2tr{(B -D-BK_)‘[P. 4F l ♦ R — <B -D-BK,)] 

" F dt “ F 

♦2tr! (A -A-BK.) T [P. 4X T ♦ S - (A-A-BK )] 

* x dt " X 

♦2tP l ( A-A *BK ) T [P« « T ♦ M - (A -BK -A)]. 

" * dt " 6 

Also notice that, slnoe matrix A is Hurwltz, then for any given posltlvo definite symmetric 
matrix Q there exists a positive definite symmetric matrix P such that 

P A • A T P - -Q 

Now, for the stability, V should be negative. One way to satisfy this is to choose 

* t -1 T 
Kp - B t R Pe4F 

K - bV’p.4X T (16) 

X 

* t -1 T 
- -B t m Pe e 

wnere B^ - [0,0, B \ is the pseudo-inverse of B. However, since R, S and M are arbitrary 
matrices, we can cftoose them such that R8 - ol, SB - 81 and MB • YI, where a,8,Y are positive 
scalars. Then denoting E - [0,0,1], the ldaptatlon a laws can be liven by 

Kp - oEPe6F T 

* T 

K - BEPedX 1 (17) 

x 

K - -YEPee 7 
e 

With these adaptation laws, the 'derivative of the Liapunov function, V, is given by 

V - -Il«llg - -e T 0e<0 

which Is negative for non-zero state error, (i.e. e*0). This guarantees the asymptotic 
stability of the equilibrium point, e - 0. 

The proposed design adaptively controls both the position and the end-effector force, and 
Is appropriate for compliant motion of the robotic manipulators. The proposed adaptive 
controller is shown in Figure 1. 

Moreover, if the spatial displacement and velocity can be directly measured, then the 
knowledge of Is not necessary for the implementation of the adaptive controller. 
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6. Conclusions 


In this paper, ths definition or asohanlosl impedance ussd In [4,5], is •■ployed. The 
sxtsrnsl fores and the position of ths snd-ef factor art rslatsd by a second order impedance 
function. The force control problen Is then translated to a position control problea. An 
adaptive controller la designed for the latter problea to achieve the compliant motion for the 
manipulator. The design uses the Liapunov’s direot aethod to derive the adaptation law. The 
stability of the process Is guaranteed froa the Liapunov’s stability theory. The aajor 
advantage of this aethod la that the controller does not depend on the knowledge of the systee 
paraaeters and those of the envlronaent. It uses the aeasured forces at the end-effector and 
the position and velocity of the end-effecto r in the Joint space. The controller Is s lapis and 
can be easily lapleaented by saall computers. 
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Figure 1 . Adaptive Position/Force Controller for Robot 
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JThis pwwiiUibm deaciib— Jpodeling and controKfesifen for flexible manipulator*, both 
from an experimental and analytical Wewpoint/From the aplication perspective, w+apor* 
0 ongoing effort within the laboratory environment at The Ohio State University, 
where experimentation on a single link flexible arm is under w ayc^Se ver al unique feature* 
of this study are described here. First, the manipulator arm is slewed by a direct drive 
dc motor ami has a rigid counterbalance appendage. Current experimentation is from two 
viewpoints: 1) rigid body slewing and vibration control via actuation with the hub motor, 
and 21 vibration suppression through the use of structure-mounted proof-mass actuation 
at the tip. Such an application to manipulator control is of interest particularly in design 
of space-based telerobotic control systems, but has received little attention to date. From 
an analytical viewpoint, parameter estimation techniques within the closed- 

loop for >e|f tuning adaptive control approaches/^TttiRrinrrodiiced is a control approach 
based on output feedbacks and frequency weighting to counteract effects of spillover in 
reduced -order model design. A model of the flexible manipulator based on experimental 
measurements is evaluated for such estimation and control approaches. 
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1. Introduction 

Traditionally, robotic manipulator arms have been modeled as being composed of rigid links, with co-iocated actuators and sensors, 
towards the goal of ensuring stable and reliable control. In order for typical manipulator arms to maintain this rigid property as 
modeled while carrying payloads, the mechanical design requires large and massive links. This in turn dictates that the torques 
applied by the joint actuators be large, and heavy, usually geared motors are needed for actuation. Moreover, the controller for 
such a system is forced to move the arm slowly and deliberately so as to prevent any swaying or vibrations. 

In recent years there has been muck interest in using light-weight, higher performance arms for both commercial and space-baaed 
applications, leading to the study of flexible manipulator control. The advantages of flexible robotic manipulators are many, 
including faster system response and lower energy consumption, smaller actuators and overall trimmer mechanical design, reduced 
nonlinearity effects due to elimination of gearing, less overall mass and generally less cost. Obvious tradeoffs, however, complicate 
the issue of flexible manipulator control, primarily centering on the design of controllers to compensate for, or to be robust in 
the pre>»*iirr of flexure effects. With the advent of advanced rornputationai resources, it rules are currently being made towards 
solution of the many problems associated with control design. 

Control design for lightweight flexible manipulator arms has gained the attention of control theorists only recently, and several 
approaches have emerged. Most prominant are approaches which either linearize and truncate for controller design, or solve the 
nonlinear robotics problem for rigid link motion control and treat the flexible dynamics separately. For example, the problem of 
observation spillover and truncation error effects is treated in llj, wherr in simulation studies a linear feedback scheme around 
a reduerd order model is introduced for a single-link manipulator. In 2! control of the rigid motion is accomplished via state 
feedback linearization whereas vibrational dynamics are treated as disturbance effects. Several other analyses have appeared along 
these basi. lines, using various approaches From an applications viewpoint, however, only a few studies have been 

documented for parameter estimation, system identification, ami control. Most prominant among these are the works of book. * t 
ai X.'MO. I l for time optimal slew experiments, related studies at JPL in flexible beam control T 2.131, Schmitz and Canon Ml 
using non -colocated and tip position sensing in the control algorithm, and several studies conducted at NASA LaRC IVlfij. 

In this presentation we report on progress made to date on modeling and control design for flexible manipulators, both from an 
experimental and analytical viewpoint. Specifically, we discuss the ongoing effort within the Control Research Laboratory at The 
Ohio State University, where experimentation on a single link flexible arm is underway. The manipulator arm is slewed by a direct 
drive dc motor and lias a rigid counterbalance appendage. Current experimentation is from two viewpoints: l ) rigid body slewing 
and vibration control via actuation with the hub motor, and 2) vibration suppression through the use of structure-mounted proof 
mass actuation at the rip. Real-time parameter estimation techniques, within the closed-loop for self-tuning adaptive control, is 
under investigation amt is described briefly here. In these initial studies, a model of the flexible manipulator based on experimental 
measurements is evaluated. 
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3* ExptriimnUl Setup 

I./ 4pparct«j 


Within the Control Research Laboratory at The Ohio Strte University, several experimental configurations are under study 
tor system identification and slewing and vibration control for flexible mechanical structures. In this presentation we fo ewe on 
experimentation and simulation analysis with a single link flexible arm, depicted in Figure 1. The arm is made of 0.0625 inch 


+ 


i 



Figure 1: OSU Flexible Arm 


thick aluminum and is counterbalanced with a rigid aluminum appendage with mass equal to that of the arm. Hub actuation is 
accomplished by a 3.4 ft-lb dirtet drive dc f^otor which has an optical encoder with a quadrature digital output to sense motor shaft 
position, and a tachometer to measure motor shaft speed. This, then, allows both hub position and velocity feedback for control. 
Strain gauges (for monitoring and parameter estimation) and an accelerometer are placed along the arm, and a 512-elemen* C1D 
linear array camera with RS-422 interface is used for sensing the tip position by observing the lamp fixed to the tip of arm. With 
such a scheme, the tip sensing mechanism (camera) is utilized in verification and tuning of the predicted endpoint position. A 
related objective for this setup is to achieve control without camera information feedback, with for example rate and acceleration 
sensing feedback, for application in space-based manipulator systems where off-structure reference for sensing is impractical. 

Some characteristics of the arm are given below. 


Table 1: Arm Characteristics 


! Material 6061 -T6 Aluminum 

I ; 

| Modulous of Elasticity 68.944 * 10* N/m* 

! Cross Sectional Area Moment of Inertia 

Flexible Arm 3.350 <l0* n m 4 

Rigid Appendage 2.427 <l0‘*m 4 


Lengths 

Flexible Arm 1.0 m 

Rigid Appendage 0.381 m 


The unique features of the structure are the direct drive mechanism, chosen to minimize effects of backlash and other nonlinearities 
due to gearing, and the counterbalance appendage, which provides a more realistic model of application -oriented structures. Such 
a hybrid structure does, on the other hand, pose unique problems for analytical modeling. 

Two computing environments are available in the laboratory for real-time control and data acquisition. The first such system is an 
IBM AT which uses different combinations of several custom-built cards in addition to the A/D equipment. These cards include a 
controller for the slewing motor with electronics utilized in processing data received from the linear-array camera. Another card, 
designed in-house, processes strain gauge and accelerometer data, and includes a low-noise, high-g&in amplifier with a low pass 
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filter and excitation to the bridge circuitry. A third custom board is used to drive the proof-maas actuators (discussed in |17] and 
later in this presentation). The board receives an analog voltage from the D/A and amplifies it to a current which is adequate to 
drive the actuators. The linear array camera is interfaced to the computer using a custom board which converts the camera's serial 
data stream to a number corresponding to the position of the beam endpoint. The second computing system is the MicroVax II, 
equipped with commercially available A/D-D/A boards and real-time operating system software. For the study presented here, 
the data acquisition is carried out using the IBM AT due to the availability of the camera interface electronics. 

S.S Modeling and Frequency Response 

For purposes of finite element modeling studies, the arm is analysed iu two separate components, those being the flexible arm 
itself and the rigid counter balance. The cylindrical mass at the end of the rigid component is modeled as a point mass, and 
the two components are connected at the pinned joint (motor shaft). For the FEM analysis, each component is modeled as a 
two-dimensional elastic beam, and the software package ANSYS (18| was used to generate the first five modes of the system 
shown in Table 2 below. We note that torsional modes were assumed to be insignificant, and were therefore neglected in the 
analysis. The principle advantage to modeling the arm in this manner is that the effects of the counter balance in the static 
characteristics are included. Several other approaches were utilised, such as considering the joint at the motor shaft to be a fixed 
point (clamped free), negating any effect the counter balance may have on the beam dynamics. Experimental results (described 
below) indicate that the former approach, described above, gives the closest match to measured responses. 

For purposes of comparison, several experiments were conducted in testing response characteristics of the apparatus. An open 
loop frequency response was found by applying a sinusoidal system input torque (varying the motor current), and recording 
measurement* of the tip position; the procedure is similar to that employed in 14/, Data was taken over the range 0.2 Hx to 13.0 
Hz. ill step* of 0.1 Hz. and the results are shown in Table 3. The system poles and zeros were found hy noting the frequencies 
which produced maximum and minimum tip deflection, respectively. An inherent assumption in this technique is that the damping 
of the beam is very *mall (this fact was experimentally verified in an independent study 19|). The damping ratio calculations 
represented in the table are based on the assumption that excitation near a modal frequency will result in the response showing 
primarily only that particular modal frequency. 


Table 2: FEM Result* 

Mode Frequency (Hertz) Table 3: Frequency Response Data 


1 

2.0091 

Minimum Tip Response 

Maximum Tip Response 

Damping Ratio 

- 

8.2509 

t 0 Hz 

1.2 Hz 

0.139 


23.1187 

10.3 Hz 

7.ti Hz 

0.050 

1 

W.5A77 

1 1 3 Hz 

12.0 Hz 

0.008 


795244 





The open loop step response (in position) of the arm was found by rotating the motor shaft through an angle of 10 degrees and 
measuring the tip deflection from its nominal value I initial point). After this maneuver the motor holds the new position (that 
is. is servoing ) since the local feedback loop is active. Figure 2 illustrates a plot of the step response. Note that while the torque 
is applied at the hnb at time t - 0, the tip deflection response is delayed by approximately 30 milliseconds and. in fact, initially 
moves in the direction opposite that of the hub rotation. The step responses indicates a settling time of about one minute. A 
fast Fourier transform of the data allows ,ear identification of the first two modal frequencies; these mrur at 1.18 Hz and 7.5 
Hz. respectively. Figure .1 ■■how* the result of the FFT for the tip position in the step response test. We note that the rigid body 
mode * d< component | flue to the pinned joint has been subtracted out of the FFT plot for clarity. 

3. Control Analysis 

./. / Prohit m hormulatnm 


( otiMder again t lie Miigle link flexible manipulator system described above, redrawn in Figure 4. The displacement of any point 
along t he arm is given by the hub angle tf(/| and the deflection o(«r.O measured from the line ox. We assume that only transverse 
vibration present and that the deflection due to this vibration is small. Let L be the arm length so that in general terms 

yt-r.M = . I 1 1 


Ax 2 


Elix) 


iP'tfx.t) 


dx 2 




tP'Mx. t ) 


( 2 ) 
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STEP RESP0NSE 


FFT 0F TIP P0SITI0N 



t. «> it. It. «. It. It. M. 4t 

TIME (X 0. 005 5EC0NQS) 
Figure 2 


t.t it. «. n. 4t. tt. tt. m, 

FREQUENCY (X O.OQ8 HZ) 


11 Figure 3 

where £7(x) is the elastic stiffness, .4 is the cross -sectional area, and in the mass density. For the mechanic a! configuration 
under consideration, (2) must satisfy the boundary conditions 


O(0, M :fl : £7(x); 


Si ~T-I„i = 0 ; £/(x»S i =0 : £/(x>S! 


where T is the torque at the hub and l» is the actuator inertia. Accordingly, (2) may be put into the familiar form for the 
generalized modal coordinates q[t) as 

A/q - 0q ► A‘q = Bf . q - [fj.f, f«! r . (4) 

where M is the mass matrix, k is the stiffness matrix, ami D contains terms associated with the damping. For position and 
velocity measurement* (in the y direction) the solution to (4) is approximated by 

tjir.n - V 1 5) 

for the eigenfunctions iv*h including the rigid Inxiy mode at k - 0, and the I are chosen to tiuniuuze the mean-square error 
upon Mibxtitntion into f 1 i. 


f nder Hi** Miularv t raii'iformatnoi then, w# use the notation q now to represent the state in the state variable formulation of 


q 0 I 1 [-I 0 q 

.qj'l-ll* -♦ r 0*|[q] ‘[♦ r fl| / • ^ [ q ] ' (6) 

where \ l * r /v •!> is .in i • /» diagonal matrix containing the squares of the modal frequencies along the diagonal. 

and h *i i> if.e me.toiirement vertor 1’pon o’ i.-rangement . we may write ibj :n the manner 


0 1 



0 1 </„ 

~ ~ “** *»•*’* . . ‘/n 


We note that in l7l the r.gid body mode has been included. Since the only control input for this example is the torque, then 
/ f Finally, the matrices C and + r B are given by 


L 0 fi (/.i 
0 1 0 
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S.t PoromtUr Estimation 


The fundamental Imm in the anthtnutied formulation of flexible mechanical street ire* lies in the fact that sock distributed 
parameter systems must be identified (controlled) with only a limited number of sensors (actuators). Intend, for the analysis of 
the single-link flexible arm we typically consider hub actuation only, and tip position and/or hub velocity measurements to be 
employed in modeling and feedback control. Moreover, without reliable models for the control design the analysis becomes even 
more difficult. Philosophically, there are several different views to take in the control design. One approach is to construct a 
controller that is reasonably robust in the presence of modeling uncertainties and spillover, and yet simple enough in structure 
to be easily implement able (for example the variable structure control approach (20) ). Another approach is to perform system 
identification exercises to model the system as accurately as possible prior to control design. A third approach is a combination 
of tbe first two: estimate the system parameters on-line (in the closed loop) and base the control design on the resulting model. 
This last viewpoint is often refered to as Self-Tuning Adaptive Control (STAC). 

In tbe STAC approach, the manipulator dynamics are represented by linear discrete- time models, affording the primary advantage 
that the controller design is inherently digital in nature. In the application to flexible structures, tuning parameters include 
combinations of the damping and modal frequencies, or some combination of other free parameters which make up the manipulator 
model. Our approach to the parameter estimation problem involves recursive least squares methods with covariance resetting. 
That is, in order to maintain a fast overall convergence rate, the covariance of the estimates is reset at regular intervals in the 
algorithm. Such a scheme is particularly attractive for the manipulator control problem due to the time-varying nature of the 
tuning parameters during slew maneuvers and varying payload exercises. Experimental studies of the parameter estimation and 
STAC approach for the arm described above are presently underway. In the following we present simulation results which indicate 
avenues to pursue regarding implementation. 
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Figure 5: Parameter Estimation Simulation Results 
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The element* of the C matrix can be found according to [14) 




and the )(0) are solved from a system of nonlinear equations. The modal frequencies can be computed a priori, or identified 

as discussed in the previous section. We model the system as a stochastic ARMA process and excite a fourth-order model of the 
arm with a white noise input. Such a representation allows a delay (in tip position response, as observed in the actual system) 
to be inserted into the model. Using a zero-order bold circuit in the model and sampling the simulated FEM model, the AR and 
MA parameters converge to their nominal values as depicted in the sample plots of Figure 5, which shows time histories of one 
AR parameter and one MA parameter. Values for damping coefficients are then calculated from these parameters. These results 
are not useful in closed-loop control however, due to the length of time for convergence to the true parameters. Note also that a 
primary difficulty results because of the approximate pole-sero cancellation in the system model (indicated by the spike at about 
0.7 seconds). Slightly better results are obtained if the rigid mode is removed from the model, which corresponds to exciting the 
unforerd system with an initial disturbance. Such an exercise is possible since the motor inertia is considerably greater than that 
of the arm. 

Prior to actual experimentation on the arm, several modifications must be investigated. For example, simulation studies for this 
and o.lier example systems have indicated improvement for different resetting intervals; for details, the reader may wish to consult 
[21 1 . Also, simple digital low-pass filtering of the measured variables has produced improved performance of the parameter 
estimator. For control purposes the simulations have shown that an algorithm which turns on the control after allowing the 
estimator to run for a short period of time (for example, as illustrated in the simulations, about 1 to 1.5 seconds) will achieve 
the control objective. However, we are presently pursuing ways of improving the time to convergence in the closed loop with 
approaches using state feedback. 


i.J Output Ftnlbtick and Frequtncy Shaping 

Generally speaking, high dimensionality and multiplicity of inputs in large-scale systems such as flexible mechanical structures 
leads to complex centralized controller schemes. One solution to this problem is to simplify the structure of the model via decom* 
position into subsystems with associated subcontrollers in a decentralised output feedback formulation. Moreover, centralized or 
decentralized output feedback is one of the more straightforward algorithms, from the viewpoint of implementation, for the control 
of flexible mechanical structures; see, for example, [22,23,20|. 

For the problem of single-link flexible manipulator control, where only hub actuation is employed in the control action, the output 
feedback approach to controller implementation is centralized in nature. The problem of spillover is, however, a critical issue to 
consider in the design. In order to minimize the effects of spillover, we consider a frequency-shaped cost functional (24) v where 
penalties aie assigned to the truncated modes and high penalties are assigned to the high harmonics at the input in order to 
minimize the effects due to excitation of the residual modes. 

We consider the cost functional J to be minimized as formulated in the frequency domain utilizing Parseval’s Theorem. With 
infinite time horizon, such a cost is written in the manner 


J = r ' X*iju/)Q{jut)X{jut) ¥ T*(ju/)R(ju/)T(juf)\duf , 

J - no 


where corresponds to the system state fas in (fi)) and T{jut) the input (torque) of the system. For implementation of such 

a scheme. consider the diagram of Figure 6, where the parameters ATf, A'j, A' a are solved for in the minimization of (10), and the 
filter pole location I * I is dependent on the system dynamics. In the example under consideration, jut) is the system matrix and 

Hi ~ t yw *■ If - - 7 ). 

Under this formulation, the open -loop state variable representation of the system has the form 
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Figure 0: Output Feedback Scheme 
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where the new measurement v(f) now includes the torque T. Incorporating the system of (11)-(12) into the output feedback 
structure of Figure 6 (and subsequent solution of the corresponding Lyapunov equation) allows the off-line calculation of the 
feedback gains from minimization of ( 10 ) by an appropriate nonlinear optimisation routine. 

We consider now a simulation of the flexible arm system, using a five-mode model from the FEM as the “truth model”, from which 
measurements are taken and fed back in the output feedback scheme. The controller design for this example is based on the reduced 
system of rigid mode plus first flexible mode, and the resulting control is then tested against the full-order truth model to illustrate 
the effects of the frequency weighting approach in reducing spillover. A conjugate gradient method is used in the optimization 
portion of the design, and the final values obtained for thecontrollaw (with 7 =* 4) are A' t = -110.09, A'j = -111.53, A'j = -88.83 
(feasible values for the system under consideration) for the cost which reached a minimum after approximately 3000 iterations. 
The results using this controller are illustrated in Figure 7 for a step input torque; this applied input is such that the tip rotates 
through a small angle (of less than 5*, in terms of the rigid position). The values for torque begin at zero, that is, the dc component 
is subtracted out. In the simulation, the response settles in about ten seconds, whereas the free response decays after about one 
minute due to damping included in the model. 

4. Structure-Mounted Proof-Mass Actuation 

Sirce t lit* large-angle slewing problem is complicated due to the flexibility effects inherent in the structure to be slewed, one is 
naturally led to investigate the possibility of relegating , at least partially, the task of vibration damping to a separate sensor- 
actuator pair ami associated feedback loop. To this end, we have been investigating utilization of a structure-mounted momentum- 
exchange device mounted near the tip of the single-link manipulator. The practicality of such active vibration damping in a 
robotic environment i> closely coupled to the availability of lightweight and effective devices. The device we have considered in 
our preliminary studies is a proof-mass actuator developed in our labs to study active vibration in space based flexible mechanical 

structures. 

Non-ground referenced linear actuators are not yet widely available on the market, and this fact led to an in-house development; 
a general view of the device as mounted on the arm is shown in Figure 8 . The device is built around a linear motor manufactured 
by the Kimco division of BEI Motion Systems which has a total mass of 25 grams, and can deliver a peak force of 2.2 N. The 
coil (solenoid) is rigidly mounted to a beam clamp which fixes the actuator to the arm. Also connected to the clamp is a rigid 
aluminum bracket which supports the springs. The proof mass is coupled to the framework through the springs, which are in 
turn coupled to the framework with adjusting screws so that their rest tension and proof mass rest position can be controlled. 
There is sufficient adjustment so that springs of different length and stiffness constant can be accomodated. The springs provide 
a restoring force for the proof mass and transfer force to the structure. A hanger was also mounted to provide strain relief for the 
feed wires. 

The proof mass consists of a rectangular steel ring with a central steel member. This central member passes through the coil and 
restricts motion to a single axis. Samarium cobalt magnets are fixed to the top and bottom inside edg?s of the ring (adjacent to 
the coil) so that the interaction of the permanent magnetic field with a current in the coil results in a force on the proof mass. 

Details ot the development of the dynamic model of the aciuator may be found in (17|. The net force applied to the tip of the 
arm (where the actuator is mounted) may be given as 

/ = 2k: - A>i. + Bz , (13) 
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Figure 7: Output Feedback Example 



mz B: + 2Jtz * Kyi + my , (14) 

where m is the proof mass, y the displacement of the structure at the point of actuator attachment, / the force acting at that 
point, z the relative displacement of the proof mass, B the viscous damping coefficient, Ky the motor force constant, and i, », are 
the input and armature currents, respectively. The actuator constants taken from the data sheets which accompany the individual 
components, are m = 0.0207 kg, k = 262.7 N-m' 1 , A> = 1.112 N-ampere* 1 . The incorporation of the above actuator creates a 
second feedback loop to which the task of vibration damping is relegated. The two control loops (for slewing and for vibration 
damping) can be both designed and implemented in a decentralised manner. Note that y includes the displacement due to both 
the rigid body mode and the flexibility (see (l)). The principle of relegation implies that we design the feedback control only for 
the latter portion. To this end consider a vibration damping loop for only the first mode, such that the relevant expression is 

+ wjqt = b\f , (15) 

where *f\ is the natural frequency of the first mode and b x is an influence factor determined from the mode shape at the point of 
interest. Acceleration feedback can be used from the co-located accelerometer and a simple PI controller has been designed. It 
is evident, however, that the STAC approach or the frequency weighted control approach outlined earlier, can also be used here. 
The incorporation of the more sophisticated design approaches resulting in more complicated controllers will aid in handling more 
than the first vibrational mode. Studies along this direction are presently continuing. 


5. Conclusion 


In this workshop presentation we have described work in progress on modeling, parameter estimation, and control studies for 
an experimental, one-meter single-link flexible manipulator arm. Models have been developed for the apparatus based on finite 
element analysis and experimental verification. These, with the closed-loop parameter estimation procedures described here, and 
subsequent STAC approach for control, are being evaluated on the laboratory arm. 

Under investigation is experimentation involving local proof-mass actuation for vibration control at the tip of the arm, using a 
device developed in the Control Research Laboratory at Ohio State for flexible structures control work. The output feedback 
frequency shaping approach described here may be easily extended to this application, where the formulation is </rcen/ra/i;ed in 
nature; results on this technique for general flexible structure vibration control will appear in (25|. Finally, various other centralized 
(for tin* case of \iil> actuation only) and decentralized approaches are currently being evaluated in the laboratory. 
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Abstract 


The nerd for dual arm robots in space station assembly and satellite maintainance is 
of increasing significance. Such robots will be in greater demand in the future when 
numerous tasks will be assigned to them to relieve the direct intervention of humans in 
space. Technological demands from these robots will be high. They will be expected to 
perform high speed tasks with a certain degree of autonomy. Various levels of sensing will 
have to he used in a sophisticated control scheme. 

(n this pmisentattnn m% mill briefly. Jw.tihe ingoing research in control, sensing and real- 
time software to produce a two-arm robotic system that can accomplish generic assembly 
Thr prr*r -ill '-rni— T*rVT unfitly ~~ the control hierarchy, the specific co ntroL 
approac heeler ted being the Variable Structure gliding Mode) Control approach/ W s ' wilU- 
coasidar a decentralized implementation of mod el -reference adaptive control using Variable y 

Struct urr^cont toilers and the incorporation of tactile feedback into it«*r3 «, s * 




2. Introduction 


'-of' 


The need for dual arm robots in space station assembly and satellite maintainance is of increasing significance. Such robots 
will be in greater demand in the future when numerous tasks will be assigned to them to relieve the direct intervention of 
humans in space. Technological demands from these robots will be high. They will be expected to perform high speed tasks 
with a certain degree of autonomy. Various levels of sensing will have to be used in a sophisticated control scheme. 

In this presentation we will briefly describe ongoing research in control, sensing and real-time software to produce a two-arm 
robotic system that can accomplish generic assembly tasks. The paper will concentrate mostly on the » ?.troi hierarchy, 
the specific control approach selected being the Variable Structure (Sliding Mode) Control approach. We will consider a 
decentralized implementation of model-reference adaptive control using Variable Structure controllers and the incorporation 
of tactile feedback into it. 

We assume that multi-arm robotic operations have a hierarchical/decentralized control structure. However, the appropriate 
control algorithms have to be chosen for feedback to properly fit the special hierarchy of multiple robots with dextrous end 
effectors. A specific control approach has to be selected, and its requirements can be clearly specified: 

• It must easily decompose into a hierarchy. 

• It must be ameanable for modular implementation. 

• It must posrss low real-time computation requirements. 

• It must be able to receive changes from sensor data. 

• It must be insensitive to modeling errors and load variations. 

It is expected that robotic systems will become an important part of future space missions. Orbital Maneuvering Vehicles 
have been proposed with dual arm systems for space station assembly, satellite servicing, etc.. Although the importance of 
dual arm robotic systems have been recognized for some time, little work of a general nature has been done in controlling 
such systems. 

Early multi-processor robotic controllers were based on the principle of a simpler low-level processor and a more sophisticated 
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high-level computer. This made interfacing fairly difficult and expansion almost impossible. With today's processors and 
appropriate software load distribution, tasks at ail levels can be handled by processors of the same family. Coordination of 
data transfers are extremely simplified. It is apparent that certain improvements will have to be made over conventional 
control structures (as used say, in the PUMA) if there is hope of accomplishing sophisticated assembly type operations using 
multiple manipulators. For versatile performance the control hierarchy will exhibit a finer task decomposition. Tasks will 
have to be relegated to a large number of processors. Sensory inputs will have to be appropriately assigned. 

The control of robots in a precise, reliable and repeatable manner is by itself a hard problem. The problem becomes 
somewhat more complicated when considering the control of coordinated robot arms. Limited work has been done in the 
area of multi-arm robot systems (1) (2), (3), (4j, [5|, (6] and [7]. 

A method developed for controlling manipulator arms by Young (8), 6sguner and co workers [9], [10], and others [11] 
utilising variable structure control theory is particularly ameanable to extension to multiple arm systems controlled within 
a hierarchical framework. Initial work along these lines have already been performed. In this paper we will be reporting on 
recent developments in the above approach and especially tactile sensing feedback from the end effector as included in the 
hierarchical control structure. 

There appear to be certain generic tasks that are imbedded in many assembly and maintai nance operations. These include: 


e Pick and place type tasks, 
e Pin in the hole type tasks, 
e Combined rotation-translation type tasks. 


Many complex operations can be partitioned into combinations of these generic tasks. Thus the control algorithm design 
and related software will concentrate on the above tasks. 

Figure 1 summarizes the control hierarchy to be used. At Level I, parsing interpreting and decoding user commands and high 
level sensory input are accomplished. Error messages to the user are also generated at this level. Level II includes trajectory 
planning, associated coordinate system transforms and analysis of bounds of the workspace. Joint-level coordination and 
transfer of information required by control algorithms is carried out at Level III. At Level IV, generation of the feedback 
control and I/O with actuators and force sensing is accomplished. .The control algorithm selected has to be strongly coupled to 
the information structure selected. The algorithm must be decomposable into the hierarchy imposed and inherently adaptive 
to load and trajectory variations. The algorithm/control approach we are utilizing is the Decentralized Model Reference 
Adaptive approach using Variable Structure (sliding mode) controllers. It appears that this algorithm with appropriate 
modifications to accomodate sensory input and user commands can be mapped onto a multiprocessor system. 



CONfPCX/fKDBACK At 
ASSOCIATED MO IOC 


CONtROl/fCEDAACK AT 
ASSOCIATED MOtOP 


Figure 1: The General Hierarchy for the Control of Two Manipulators 
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Various results hare been recently reported in the utilisation of sensory information from the end-effector in the feedback 
control structure. In the present work we will be using the force feedback approach (tactile sensing) as reported ia [12). The 
incorporation of force feedback into the control algorithm is not straight-forward, and in the next section we will introduce 
the concept of inierucften compensation which will aid in the analysis. 

3. The Concept of Interaction Compensation 

It has been previously claimed that the levels of a hierarchy (as in the multi-manipulator system of Figure 1), can he classified 
so that one identifies "increasing intelligence with decreasing precision", as one moves up [13]. As with most labeling schemes, 
this may be an over-generalisation and there may be numerous cases where proper rtlegation of (a) Control Authority, and 
(b) Information Distribution, may result in preferrable operation of the over-all system* 

We will consider the regulation of an interconnected system to introduce the concept of Interaction Compensation, which 
we will subsequently apply to the specific case of multi-manipulator control; under a fixed Control Authority structure and 
control algorithm., 

Consider a large-scale system consisting of N interconnected subsystems each defined by 

*. a A, (**)*, 4 4- E t (x) (l) 

V« a D t x , , (2) 

for t a 1,2,... , W, where x< t ti, c where ft represents the real fiuclidean vector space, x T » (*f,xj\...,x$) 

and the matrices are of compatible dimension. £,(*) denotes the totality of interaction effects from the remaining subsystems 
to subsystem i. Note that E,(x) may also include modeling errors. 

Let us first define what is meant by insensitivity to intend' jn. Consider again (l)-(2), rewritten for brevity as 

ii a /(x,,u,) 4* A(*,0 , (3) 

for the state transition mapping / : R" 4 x — > IF** and the total interaction term A(x,t). The system (3) is said to be 

insensitive to interaction effects if the solution x,(t) may be expressed as 

*.(0 = i,(t) 4- 0{£ t ) , 

where i f (t) solves x, = u t ) f for all e, > 0 and all t > T, for some finite time 7\ and C?(e,) represent* terms of degree two 
or higher in c,. 

In reference to measurements available for use at the control inputs of subsystem i we can now consider three possibilities: 

1. Full (real-time) interaction information. 

2. Partial interaction information. 

3. Interaction modeling. 

It can be shown that interaction information provides the opportunity of directly negating all effects within the range space 
of B % . The more interesting cases are when partial information is available or can be genented through dynamic modeling of 
the interactions. 

Given the model, the decentralized control problem is to design a controller to feedback locally available real-time information 
such that the states of each local subsystem are regulated to zero or track the states of a local reference model. 

Let the local reference model for the i-th subsystem be given as 

. x, = .4,x t 4 B,r, (4) 

y» — i (5) 

with x, t #"■, .4, t ti’ 1 '**', B % t tf"** 1 , where r, is a scalar reference input. Define the local error between the ith subsystem 
and its local reference model in the manner 


e, = i, - x, (6) 

so that the local error system dynamics may be written in the form 

e, * A % e t + (/!, — Ai)Xi 4 Bir t - B t u t - £,(x) . (7) 
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Within this framework, uiumc that 

• Each local controller design is dependent only on the local model. 

• The systems (l)-(2) and (4H&) controllable, 

• The states *< and are measureable locally for feedback to the tth input. 

• Reference trajectory information may be fed to a subsystem from a higher level coordinator but interaction information 
is only partially available in real-time , and the subsystem is not allowed to communicate with the other local controllers 


4. Variable Structure Controllers 

In this presentation we are going to assume that the basics of Variable Structure Control are known. An important feature* of 
Variable Structure Controllers is the fact that, for the decentralised case, the local subsystem is made insensitive not only to 
local parameter changes but also to dynamical interactions with neighboring subsystems once the sliding surface is reached. 

We define the sliding surface corresponding to the system (l)-(2) as 

. ( 8 ) 

for Ci in ft 1 "”'. Invariance properties of the sliding surface were given in (14,15). Refering back now to the error system (7), 
the control law is formulated in the manner 

U| = A # , t x + +■ A, ( fj + » (9) 

where K, t in ft 1 , A', ( in ft 1 ”"*, and A",, in R lM "' can be specified in different regions of the state space, and where 4, is usually 
a constant picked according to the norm of the interactions. The elements A*,., A',.i A\ t , and 4, are discontinuous functions 
of the sliding surface and the coefficients of system and reference model state equations. We furthermore claim that estimates 
for 4, can he refined with knowledge on interactions. In the following we derive the appropriate forms based on the reaching 
condition which, in view of (7), (8) and (9), becomes 

= c,(,i. - " A % ] — Bih ak )x t <r x 

4 - C,[B, - - C,{£,4, + £,(*)}*< 

< 0 ( 10 ) 


The condition (10) is satisfied provided that the gain parameters and are chosen so aa to make each term negative. Thus, 


(AV), = a„(C*r l Ji;cX < Jsgn((e l )^ ( ] ; (11) 

jjjjejlaji -«./)J *gn((i, )i<r,] ; (12) 

= { 7 ,(C,fl,)''C l fl l }sgn(r,<r l | ; (13) 

where a,/, Bu, 7, are positive constants, and ( • )/ represents the fth element of the indicated vector. Let 

A, - max f| E(x) || , (14) 

and assuming that the only locally available information is A,, we can pick 

4, =A .(C.S.r 1 || c, II sgn[<r,| , (15) 

On the other hand, if the interaction effects are split into two portions; namely an unknown (but with known bound) portion, 
and a measurable portion, as given below: 

E,(r) = E\ u (x) + E tm (x) , (16) 


292 



the concept of interaction tomptnmlion can be utilised to directly negate the effect* of the meaanred portion. Furthermore, 
if for specific applications, the measurable Interactions an to assume desired nines, or follow prespecified trajectories in 
time, they can be included easily into the model -reference framework above. 

S. Robotic System Configuration and Modeling 

The system under consideration consists of two different robotic arms, each a planar three-link manipulator (Figure 2). The 
parameters of each link are shown in the figure where 



Ana 41 Ana # 2 


Figure 2: Two three-link, planar manipulators. 


L t t = Length of each link i = 1,2; j ~ 1,2,3 

t %) = Location of center of gravity with respect to the end of the previous link 

rriij = Mass of i>-th link 

J t j = Moment of inertia about the corresponding center of gravity 
$ij = Angular position measured counterclockwise 
r,j = Torque actuating the tj-th joint 
d = Distance separating both arms (on the base) 

The overall task can be divided into three phases; approaching phase, grasping phase, and lifting (coordination) phase. In 
the approaching phase each arm moves toward the object to be picked. Speed and position control are applied according to 
the characteristics of each arm. The grasping phase design, although not considered in our study, depends on the sensory 
system assumed to be available. Force sensing can be utilised to implement a controller using force feedback. The last phase 
is the lifting phase during which the two-arm robotic system forms a closed-chain mechanical manipulator. Again, tactile 
feedback can be applied at this phase and we will discuss incorporation of such interaction information below. Lagrange** 
method has been used in finding the dynamical equations of this robotic system, where the equations are written in terms of 
the total kinetic energy ( K) of the system, the total potential energy (P) of the system, and a set of independent coordinates 
(fi) chosen to describe the configuration of the system. Furthermore, these equations may include dissipation functions for 
non-conservative systems. We will not present these equations here but just briefly analyse the conditions while the two arms 
are in contact. For the first arm, let 

a (0u 0i2 0 l$ Y 

T x m (T xl T l2 TxzY , (17) 
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.It can then be shown that, in the approaching phase the dynamical equations of the first am can be written as 
M\i\ + Fi{0 % , ^i)^J + 0i$% ■ T\ , 
where =* 

On the other hand, during the grasping and holding phases, a closed chain robot is formed through the continuous contact 
of the end effectors of both arms with the object. This constraint defines a frictionless manifold which can be expressed in 
dosed form. It is assumed that an additional torque (to be denoted as rj) can be defined to maintain the Up of the end 
effector on the manifold. The dynamical equations of the closed chain may then be written as 

A/jfii + F\(fi, fii)^f + 0 \$\ * + f| . < l9 > 

To find the equation for r u consider a vertical displacement (fis). The corresponding work done by n is sero; that is, 

6W » t x S* 

= + #7„«v . (20) 

where JFn is a generalised force due to the contact. The above equation can then be expressed in terms of the joint angles 
since 


z = L \\ sin fin 4- In sinfin + Inrinfin 

y * -£u cos - lucosdi] - £||Cosd|| 

bx — In cos 9\\S9\\ 4* L\% cos ^ In cos fin^^ts 

Sy — In sin In sin 4* lu sin . 


Substituting the above into (20) results in 


(FJJ| cos fin 4- F7 4 sin fin) In 


' nt * 

(F74 co#9 u + jFTIi sin £-12 

= 

n* 

. (FJ^ cos fin 4- FJ4 sin fin) In , 


. r u . 


(21) 


Furthermore, since Fj* 4 and Ff A are along the direction of motion and normal to it, they are related by *7. a w here 
fA < 1 is the coefficient of friction. 

Following the same procedure used for deriving the equation of the first arm, one can easily find the dynamical equation* 
of the second arm. Furthermore, a similar equation to (21) can be found for the second arm to satisfy the coordination 
movement constraint. The decentralized model -reference adaptive controller is now utilised to control the robotic system* 
To this end, each link is considered an independent subsystem with coupling forces and/or torques being the interactions. 
Thus, 

5| = Link #1 of the first arm with X\ = (xn Xn)* = (fin fin)' 

5s == Link #2 of the first arm with X t = (z 3l Xn) 1 = (fin fin)* 

5) = Link #3 of the first arm with X * = (x ai xjj)* = (fi is fin) 1 

5« = Link #1 of the second arm with X* = (x«| x^) 1 3 (fi n fiji) 1 

5s = Link #2 of the second arm with X t ~ (x, t Zu) 1 = (fijj fijj)* 

5# = Link #3 of the second arm with X 4 - {z 4l x«)' * (fi« fits) 1 • 


Each subsystem of the above has the following general form: 


X 


= A,iX t j)X, + B i v i + £A i 9 ix,t)x, 


#•» 

»** 


( 22 ) 


where U, = T,. Detailed derivations of these equations in the above form may be found elsewhere [16). One can note that 
any measured torque or force betwean the links can be incorporated into the control structure discussei previously. 
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6* Controller Design 


The use of Variable Structure ControUen for robotic •/stem control was introduced by Young (8j. Among more recent work 
in the area ooe can cite those of Morgan and dtguner [9| where deceit Wised controllers were employed, of Slotine and Sastry 
(11} who dwell on the reduction of chattering, and of Young (14) who introduces the design of variable structure model- 
following control systems. The present work differs from the above in that it uses a Decentralised, Model Reference Adaptive 
approach and specifically addresses the multiple manipulator control problem. The controller devised for this robotic system 
is organised in a hierarchical framework. The levels of the hierarchy are divided into two, and the information processed 
at each level is not directly available to the other level. Figure 3 shows the levels and information flow of the hierarchy, 
indicating that there are two paths of information flow. Downward moving data presents the flow of commands while upward 
moving data presents the flow of feedback information. As shown in Figure 4, three tasks are defined at upper level: planning 
the motion of the end effectors of both arms, defining the local reference inputs for each link, and finding the upper bound 
of the dynamical interactions between subsystems. Figure 4 schematically depicts the operations done at this level. 

The end effectors of both arms are required to move in the work-space in a specific way. A path is a continuous curve in the 
system workspace connecting the tip initial configuration to the final configuration through all intermediate configurations. 
On the other band, a trajectory is a continuous curve in the state space of every link joining the initial state to the final state. 
In other words, the trajectory contains all the information about the time history of position, velocity, and acceleration for 
each link. Therefore, a trajectory include., not only a path but also velocity and acceleration at every point at the path. 

The first step in generating the two arm robot motion is to characterise the path in some manner, typically by applying 
physical intuition to some extent. Esentially, the arm should start and stop slowly with a smooth moiion. A number of 
different trajectories may be proposed to satisfy the requirements, such as exponential and polynomial trajectories. 

In the present study, the following equations were used 

*(*) = -«(</) ]e- <M4 - ur (23) 

v(*) = y(h) + [*(*•) - v(f/)l«‘* <, ’ < * r , (24) 


Level I 


Level II 


S 



Figure 3: Control Hierarchy 
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Figure 4: High Level Controller 


where a,6,m, and n are real numbers. In finding the reference input r; for ever/ link of each arm, the inverse dynamics 
approach is utilised. Note that, in the coordination (actual lifting) phase, both end effectors more on the same path. During 
this phase, if a desired contact force profile is required, this can also be cast in the framework of reference generation in the 
given formulation. 

The low lerel controller consists of indiridual controllers for each link. Each controller is designed following the model 
reference adaptive, variable structure system approach. In this level, each subsystem is controlled separately to follow the 
reference model. This is to be done using only local information such as position and speed information of both the subsystem 
and the corresponding local reference model. Furthermore, the local controller required to force the local states to follow 
the states of the corresponding reference model is designed' using the VSS approach presented earlier. Further details and 
simulation studies of cases without tactile feedback may be found in Ref. [16] . 

7. Conclusion 

In this presentation we have briefly described ongoing research in control, sensing and real-time software to produce a two- arm 
robotic system that can accomplish generic assembly tasks. We ha ye concentrated mostly on the control hierarchy, the specific 
control approach selected being the Variable Structure (Sliding Mode) Control approach. The decentralised implementation 
of model-reference adaptive control using Variable Structure controllers was shown to be particularly suitable for such an 
application and the incorporation of tactile feedback was possible. Research is presently continuing on adjustments in the 
feedback gains when a desired torque/force profile is given for end-effectors in contact with each other or other external 
surfaces. 
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a/ o' y 3 

Geometric Foundations of the Theory of Feedback Equivalence 

R. Hermann* 

NASA Ames Research Center 
Moffett Field, CA 94035 

X. INTRODUCTION 

For the past ten years, a group of resoarchers-*mathematicians and theoretical engineers, 
centered at, and partially supported by, the Flight Control group at NASA- AMES— have attempted 
to push beyond what was done in the 1960's for linear control theory, and develop effective 
methods for controlling systems whose dynamical equations are fundamentally nonlinear. Our 
applied focus has been the practical problems encountered in designing aircraft and helicop- 
ters, but our methodology— based as it is on fundamental mathematical principles— is adapt- 
able to robotic systems. 

Conversely, we hope that use of the new ideas under development in the computer science 
and AI community will help us use computer technology in a more effective way to handle types 
of control problems — particularly of a "discrete event" nature — that have been difficult to 
include in a differential-equations based methodology. 

Taking a historical view of progress in engineering and engineering-related mathematics , 
the situation becomes clarified. The breakthroughs of the 1960' s in control theory were 
closely linked to the development of computers, which could solve differential equations very 
efficiently. Mathematically, assumptions on linearity worked well because of the nature of 
the engineering problems that needed to be solved, especially in the Apollo Program, where 
the space craft could be treated satisfactorily as point particles, or at worst as rigid 
bodies. In the 1970's we attempted to adapt the mathematical techniques developed in the 
1960 's to the more difficult problems of control of aircraft and helicopters in circumstances 
where the assumptions of linearity of the dynamics could no longer be realistically justified. 
Recently, there has been a change in computer technology — such as LISP logic-based symbolic 
computation and greatly increased potentialities for parallelism — that has not yet been fully 
integrated into the main body of control theory. Further, computer science has achieved 
greater maturity and substance, and I believe that there are great scientific and technological 
possibilities in combining the talents and insights in the two communities. What control 
theory has to offer is a mature, mathematically based overview of a certain class of engineer- 
ing problems, based on concepts of differential equations and dynamics, while the youthful 
vigor of the computer science discipline is generating a lot of energy, but exhibiting the 
need (in my opinion, at least) for more scientific and mathematical direction. 

For the past two years, I have been trying — with George Meyer's advice and support on the 
engineering questions— to push in two directions. First, to understand how the control 
techniques of feedback linearization — developed as a useful control algorithm by Hunt, Meyer 
and Su at NASA-AMES [1,2] — can be integrated into the mainstream of differential geometry and 
extended in the direction of understanding the relation between global and local feedback 
linearization. Second, I have tried to familiarize myself with the LISP and logic-based 
computer technology and algorithms, and help in the job of introducing it into control theory. 
Since the first part of this program is further along — a major mathematical paper is now 
completed [3] and awaits publication — I will describe some of the ideas it contains here, and 
leave my ideas about developing relations between computer science and control theory to 
anotner occasion. 

2. A VIEW OF FEEDBACK CONTROL IN THE CONTEXT OF DIFFERENTIAL EQUATIONS, DIFFERENTIAL GEOMETRY, 
AND LIE THEORY 

A feedback control system can be taken as an underdetermined system of ordinary differen- 
tial equations of the following general form: 

f ( x# if ' u ) * 0 (2.1) 

x E R n ; u € R m 
f is a map: R^ n+m -♦ rP 

53 Jordan Road, Brookline, MA 02146 ^ 

*Tnis work was begun while the author was a National Reseu~ Cojncil Senior Research 
Associate at the Ames Research Cente%, and continued under grant INAG2406. 
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i* * vector of r” describing component* of the system (aircraft, helicopter, spacecraft, 
robot,. ••) that are fixed in value, such as velocities, positions, angular or linear momenta, 
etc. -u- are the control variables, which we must choose in some way to achieve a prescribed 
goal. 


Feedback c control can be described as follows. A feedback map ,or law is a map 

x ♦ F(x) - u (2.2) 

0 R 10 

from an open subset 0 of R n to the control space R m . A trajectory of the feedback 
control law (2.2) is a curve 

t - x ( t) (2.3) 

in R n that satisfies the following ordinary differential equations 

f(x<t>, §f, P(x(t))| - 0 (2.4) 

In engineering practice, we will want to choose the feedback law (2.2) , so that the 
family of trajectories defined by (2.3) and (2.4) will have certain stability, robustness, and 
design properties. (For example, for the latter one might want the trajectory (2.3) to start 
off at time t ■ 0 at a point xq and end up exactly or approximately at a point x A at t-t^.) 
Stabilisation is the property that is best understood mathematically, hence I will use it as 
a touchstone here. 

Much of the work in control theory of the 1960's — which was very successful on both the 
mathematical and practical fronts — was oriented toward linear control systems, i.e., those of 
the forms 


3 ^ - Ax - Bu - 0 (2.5) 

where A and B are constant matrices of appropriate size. Here, it is natural to require 
that the feedback (2.1) preserve this linearity. This can be accomplished by specifying that 
the feedback map ( 2 . 2 ) be of the following form: 

u - Kx ( 2 . 6 ) 

where K is an m*n real matrix. The trajectory equations (2.2) are then of the following 
form: 


|| - (A + BK) (2.7) 

One may then require that these trajectories have a prescribed degree of stability. Because 
(2.7) is a system of differential equations that can be handled with well-Known mathematical 
teenniques, we know that this behavior can be specified by imposing conditions on the eigen- 
values of the n * n matrix 

A+BK (2.8) 

In turn, this "pole-placement" problem can be handled with well-known mathematical techniques 
(matrix Riccati equations or Kronecker pencil theory) that were applied in the 1960's, but that 
of course go back many years in the mathematical literature. 

It is especially interesting that the useful sufficient conditions for stabilization of 
( 2 . 5 ) via linear feedback ( 2 . 6 ), i.e., "pole-placement" in the engineering jargon, involve 
controllability of the control system ( 2 . 5 ) and is a mathematical concept that is — as I showed 
many years ago (4] — essentially differential- geometric and Lie-theoretic in nature. Thus, it 
should be no surprise that the problem of stabilization and feedback control of a more general 
nonlinear system of type (2.1) also involves differential geometry and Lie theory. 

Indeed, the work of Hunt, Meyer, and Su [1,2) (preceeded by work of Krener [5], Brockett 
[ 6 ], Sommer [7), Jakubzyk and Respondek ( 8 )) demonstrate this in a decisive way. Their work 
only dealt with feedback control of a certain class of systems (the feedback linearizable 
ones, with the functions f( , , ) occurring in ( 2 . 1 ) satisfying certain conditions) if the 
trajectory stayed within a small neighborhood R n , whose size could not be specified in 
advance. This posed the question of finding conditions for global feedback equivalence. 

There has been important partial work on this problem by Boothby, Dayawansa, and Elliot [9-11] 
using the tools of differential topology and foliation theory. In my paper [3] I have begun 
to develop ways of applying the Ehresmann-Haef liger [12] theory of pseudogroup cohomology to 
this problem, but there is a long way to go before the results that are useful in practical 
situations will come forth. 
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The mathematical ha art of the mat ho da I have davaXopad in 13] is tha thaory of vactor 
fiaid systems (or distributions) on a manifold and thair equivalence* X will now sketch soma 
of this basic differential-geometric thaory, than raturn to tha control situation* 

3. VICTOR FIELD SYSTEMS AND FEEDBACK EQUIVALENCE 

X will now usa tha formalism "calculus on manifolds," particularly tha thaory of vactor 
fialds (i.e#, first-ordar linaar partial diffarantial oparators) and tha Jacobi-Lie bracket 
[ . ] (i.e., commutator) of such vactor fialds* Saa Xsidori's book (13) for an engineers 
introduction to thasa concepts* 

Lat 8 be a manifold, with V(8) tha space of vactor fields* In terms of coordinates 
(»*) for 8, 1 < i,i < M ■ dim z7 a V€V(Z) is a differential operator of tha following 

forms ” 

V - A 1 (*) $-r (3.1) 

a * 1 

(summation convention in force) 


If 


than 


V* 



[V,V] 



- B 


3A^ \ 

7 ?) 


a 

57 


(3.2) 


(3.3) 


Let F (Z) be the ring of C* , real-valued functions on Z. V(Z) is a module over 
F (Z) , since vector fields can be multiplied by functionss 

(f,V) * fA 1 (3.4) 

a* 

Definition , A vector field system W on Z is a subspace of V(Z) satisfying the follow- 
ing condition : 

fV € W for V € W 
V^V 2 € W for V 1 ,V 2 € W 


i.e., W is a submodule of V(Z). 

Let W be such a vector field system. For z € Z, set 

W(z) * (V(z) : V€ W) (3.5) 

W(z) is a linear subspace of the space of tangent vectors at z. Its dimension is called 
the rank of W at z. W is said to be nonsingular if the rank is constant as z ranges 
over Z. In this paper we will assume that all vector field systems considered have constant 
rank, unless specified otherwise. The concept defined next will play a basic role in this 
work . 


Definition . Let W c V be a vector field system. Set 

C(W): (V € Ws (V,WJ c W) (3.6) 

C(W) is called the Cauchy Characteristic system associated with W. 

Theorem 3.1 * C(W) is another vector field system on Z with the following properties: 

C(W) c W (3.7) 

(C(W) ,C(W) ) c C(W) (3.8) 

i.e,, C(W) is Frobenius integrable as a vector field system 

(C(W) ,W] c w (3.9) 


Proof. Follows from (3.6). 
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Definition , A curve t • *(t) in z is called an orbit ourv # of tha vactor field lyitta If 
if *We following condition ia satisfied! 

Thar a la a vactor flald V 


V 



In W auch that t + s(t) la an orbit curve of V, i.e., if 
|| - V(a(t) ) 


( 3 . 10 ) 


or, in coordlnata terms i 


- A(z (t) ) 


( 3 . 11 ) 


In tnia way, a vactor flald ayatam dafinaa a family of curvaa on Z. It ia thia gaonatric 
proparty that la tha key to tha uaafulnaaa of vactor flald ayatama in control theory. Aa we 
have aaan, control ayatama ara alao defined by familiaa of curvaa, namely aolutiona of tha 
control aquation* i 

f ( x(t) * S» u(t) ) ■ ° (3 * i2) 

x e r" , u € rP 

Sett 

Z m R n * R® 


Z ■ (X,U) 

Wa can than define a vactor field syatem W on Z aa tha amalleat submodule of V(Z) whose 
orbit curvaa are solutions of tha control equation (3.12). 

Let X and X 1 be manifolds. Let W and W' be nonaingular vector field systems on 
X and X', respectively. Let 

a: X - X* 

be a dif feomorphism. 


Definition , a is called an tquivalenoe from the vector field system W to the vector field 
system W 1 if the following condition is satisfied: 

a # (W ( x) ) » W'(a(x)) (3.13) 

for all x € X 

i.e., if a maps an orbit curve of W into an orbit curve of W'. Our problem is to describe 
numbers attached to vector field systems that are invariant under equivalence. We shall cite 
(without proofs) some of the theorems from (3) that do provide such invariants. 

Theorem 3. 2 . Let a: X X v be a dif feomorphism from X to X* that is an equivalence of 
vector field system W to vector field system W*. Let C(W) and C(W*) be the Cauchy 
characteristic systems of W and W*, respectively. Then, “the following condition is 
satisfied: 

a* (C (W) ) - C(W*) (3.14) 

i.e., a is an equivalence between the Cauchy characteristic systems of the given vector field 
systems. 

Definition . For the vector field system W, set: 

W 1 - W + (W,W] (3.15) 

It is called a derived aystem of W. 

Theorem 3.3 . Let a be an isomorphism from W to W*. Then, it is an isomorphism of the 
derived system w* to w' 1 . 
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We can now iterate. Set 


(W 1 ) 1 ■ 


(3.16) 


to define the euooeeeive derived eyeteme of the given vector field syetem W# denoted ae 
*r#W #••• we obtain an increasing filtration of submodules of the module of all vector fields 


on Xt 


W c W 1 c W 2 c ... 


(3.17) 


Theorem 3.4. We havei 


C(W) e C(W X ) c C(W 2 ) c ... 


(3.18) 


In words# this says that the Cauchy characteristics of the derived systems also form an 
ascending# filtered sequence of submodules of the module of all vector fields on X. 

We assume that all the modules (3.17) and (3.18) are of constant rank, set 

r ■ rank W 


c - rank C(W) 
r, - rank w 1 


(3*19) 


rank C(W‘) 


and so on. 


Theorem 3.S . The sequence of integers 


r i r l £ r 2 i * * ' 


( 2 . 20 ) 


c 1 c ! £ c 2 i 

attached to the vector field system W are numerical equivalence invariant*. 

Let us now apply these results to control systems in state space form. 

4. FEEDBACK INVARIANTS FOR CONTROL SYSTEMS IN STATE SPACE FORM 

Let us now specialize the feedback control system to consider those of the following etate 
apaoe form: 

- f(x # u) (4.1) 

y e R n , u € R ra . 

Theorem 4.1 . Let 

Z - R n *R m 

- { ( x , u) : x € R n # u € R m ) 

Let W be the vector field system Z generated by the components of the following vector- 
value3 vector fields on Z : 

s - ! f(x ’ u) k'h\ (4 - 2 > 

Then# the orbit curves on W are precisely the curves t *•> (x(t)#u(t)) that satisfy the 
control equation (4.1) , . 

Theorem 4,2 . Let dx/dt * f(x#u)# and dz/dt » h(z#v) be two feedback control systems with 
tne same number of states and controls. Let 


! f (x,u) 


i_ JL I 

3x 3u | 


and 
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W' - jh(y,v) , I 7 J 

b% the vector field systems assigned to these control systems. Let 

Ti R n+m R n+m (4.3) 


be a C* 


with 


map of the following form: 
T(x,u) «► <y,v) 


y - a(x) 
v - 0 (x , u) 


(4.4) 


(4.5) 


Then T maps the control system {dx/dt * f(x,u)) into the control system 
(dy/dt ■ n(y,v)}, in the sense that it maps solution curves of the first system of ordinary 
differential equations into solution curves of the second, if and only if T is an equiva- 
lence from the vector field system W to the vector field system W'. In particular, the 
integers r , ri , • • . ; c,c^ , . . • assigned by (3.3) to W are invariant~under feedback equivalence. 
In the case of a linear control system, these integers can be computed in terms of the 
aontrollabi l i ty indioee. 

Let us now consider the vector field systems associated wi:h a linear, scalar input, 
control nystem, i.e., one of the following form: 


dx 

It 


Ax + bu 


(4.6) 


x € R n , u € R, b € R n 

Associate with that system the following pairs of vector fields on R n : 

» ■ “k 


(4.7) 


- b i- 
3x 


Let W be the vector field systems on R spanned by these two vector fields and 3/3u. 
Set: 


V A =» Ad (V) ( V Q ) 


Iv ' v i-i> 


(4.8) 


for i > 0 


Theorem 4.3 . The following commutation relations hold among these vector fields on R : 


[V,V i l > V i+i , for i - 0,1,2,... 


(4.9) 


(v i ,v j ] = 0 , 


for i , j * 0,1... 


The derived system is the vector field system generated by 

I tu ' V#V i ; 1 * if * * m 'i ! 


(4.10) 


Theorem 4.4 . If the system (4.6) is controllable, then 




I Tu 


V V 1 



(4.11) 


As I show in [3], Theorem 4.4 
Hunt, Meyer, and Su [1,2] provided 


is the geometric heart of the sufficient conditions that 
in their work on feedback linearization, namely: 
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Theorem 4.5 . Let be vector fields on R n generating a single input, controllable 

control system of the rol lowing form 

- V(x) + UV Q (X) (4.12) 

Suppose the following condition is satisfied: 

The vector field systems 

{ V V l“ v j * (4 - 13) 

are Frobenius integrable for all j . 

Then, the system (4.12) is locally feedback equivalent to a chosen system. 

In the Hunt-Meyer-Su work, the transformation T, which establishes the feedback 
equivalence of (4.12) with a linear system, is obtained as a solution of a system of first 
order, partial differential equations, and we can only prove existence of such linearizing 
transformations locally . A basic question is: 

How to piece together such local feedback equivalences 
to find a global one? 

The answer can be described in terras of cohomology theory 112) . Indeed, this is a typical 
problem of global differential geometry: 

Find conditions for the existence (and computational 
feasibility!) of a global solution of a system of 
partial differential equations when the conditions for 
existence of local solutions are satisfied. 

What complicates the analysis of the conditions for existence of a global solution is 
that the cohomology theory one must use involves an algebraic object — the groupoid of feedback 
automorphisms of the linear control systems — that is infinite dimensional , so that standard 
topological techniques are not very helpful. It is interesting to note that elementary 
particle physicists at the frontiers — in the so-called string theory — are involved with 
mathematical monstrocities that are very similar to these! Work on this question is in 
progress. 
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Reducing Model Uncertainty Effects in Flexible Manipulators 
Through the Addition of Passive Damping 


T.E. Alberts 
Old Dominion University 
Norfolk, VA 23508 


1. Abstract 

An Important Issue In the control of practical systems Is the effect of 
model uncertainty on closed loop performance. This Is of particular concern 
when flexible structures are to be controlled, due to the fact that states 
associated with higher frequency vibration modes are truncated In order to 
make the control problem tractable. In — thi s pape r emfr fr ey digital 

simulations of a single-link manipulator system ^ to demonstrate that passive 

damping added to the flexible member reduces adveF'jr^rfectS JSSdc'IA'Teb witn / 

model uncertainty. A controller was designed based on a model Including only 

one flexible mode. This controller was applied to larger order systems to 

evaluate the effects of modal truncation. Simulations using an LQR design 

assuming full state feedback Illustrate the effect of control spillover. 

Simulations of a system using output feedback Illustrate the destabilizing 
effect of observation spillover. The simulations reveal that the system with 
passive damping Is less susceptible to these effects than the untreated case. 



/c 


2. Introduction 


Many In-space robotic operations will require arms capable of very long reach, while like other space 
structures, they must be lightweight. Because such arms are likely to be highly compliant (as Is the space 
shuttle RMS arm), control strategies designed to accommodate structural flexibility must be considered. 
Controlling flexible structures through purely active measures can be cumbersome In terms of hardware and 
computation time requirements. Moreover, active controllers for flexible structures are subject to 
Instability and other problems associated with model uncertainty. The burden of active control can be 
reduced by augmenting active control with passive damping. This enhances system stability and reduces the 
adverse affects of model uncertainty, thereby providing justification for the use of low order dynamic 
models and controllers. 

In this paper we consider a single-link, single-axis arm which rotates in the horizontal plane about a 
pinned hub In response to a control torque t(t) . The system, Illustrated In Figure 1, and the models 

employed In this Investigation are based upon a laboratory version of the arm that has been used In 
experimental Investigations [1-4] at Georgia Tech. The flexible member Is a long slender beam that is 
assumed Infinitely stiff In vertical bending but flexible in horizontal bending. 
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Tht pinned hub has rotiry Inertia J. A point payload mass m Is fixed to the bean's tip. Although, 
manipulators sonatinas carry payloads that have significant rotary Inertia, the effect of this Inertia If 
qualitatively similar to that of a point nass for the configuration considered, and hence payload Inertia If 
not Included here. Friction In the pinned Joint Is represented by a rotary viscous dashpot. TMi 

configuration Is viewed as being representative of lightweight, large payload capacity Manipulators, 
Parameters and dimensions for the am that the system considered In this paper are tabulated In Appendix A. 

Damping augmentation Is provided by a constrained viscoelastic layer damping treatment [2,5]. ta 
approach Involves bonding a thin film of viscoelastic material to the flexible member's surface. TMi 
viscoelastic layer In turn has a stiff elastic constraining layer bonded to Its surface. The combine* 
system forms a sandwich-1 ike structure Illustrated In Figure 2. When elastic deflection of the structure 
occurs, shear Induced plastic deformation Is Imposed In the viscoelastic layer. The energy dlsslpatlo* 

associated with the plastic deformation provides the desired mechanical damping. The damping ratio for th* 
untreated beam was approximately constant for all modes at .007. The treatment Increased the damping ratio! 
associated with the modes of Interest (say the first six modes) by about an order of magnitude. The treate* 
beam had a damping ratio of .03 for the first mode and the values for the 2nd through 6th modes ranged fro* 
.052 to .06. Additional damping Is Introduced by joint friction. 


HeetU Cenatr 



Layer 


Figure 2. Treated Beam Clement Under Flexure 


The first step of controller design Is usually the development of a "design model" that Is a slmpllfle 
representation of the actual plant dynamics. The design model serves as the basis for controller design 
In the case of flexible mechanical systems, the design model Is often a truncated representation of th 
actual plant, retaining only a few critical modes. This implies the assumption that a model based upon 
small number of vibration modes provides adequate representation of the much larger order actual plant, fo 
controller design purposes. The modeling error associated with the neglected modes, adversely affect 
closed loop system performance. In this paper, simulation results are presented to illustrate that the 11 
effects associated with modeling error are reduced somewhat through the addition of passive damping to th 
system. 

We consider a multivariable control system, designed according to the steady state linear quadratl 
regulator (LQR) approach. A four state model Including only one flexible mode and the rigid body «od 
represents the design model. We consider the consequences of controlling larger order plants 

representa tlve of the actual system, with a controller derived for the design model. 

The regulator Is formulated to penalize tip position and control effort. Two cases are considered 
The first assumes that full state feedback Is available. The second case uses output feedback of tl 
position (v L ), tip velocity, hub angle (0) and hub angular velocity. The controller designs are kept slnpl 
to facilitate comparisons between the damped and undamped systems. 1 


3* Dynamic Model 

Linear transfer function models for the system of interest were developed based on the assumption ( 
small bending deflections and small hub angles. Transfer function modeling for similar systems has be« 

discussed by several authors [2, 4, 6-8] and we will not repeat the procedure here. Details on development * 
the model employed here may be found In [2], The transfer function poles and zeros used in thl 
Investigation are tabulated In Appendix B. 


1/ Although It possesses some light structural damping and is affected by joint friction, we shall 
designate the untreated arm as "undamped". 
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In the Inurtit of obtaining « tut* ipici realization fro* t h* transftr functions It It convenient to 
work with the partial fraction expansion f ont giv en by Equation 1. Tha w*t art tha nodal frequencies* 

the w*s are tha damped nodal frtquanclts w ✓ 1 - » the C*s art nodal da up In? ratios and n It tha nunbar 

of flexible nodat represented. Damping di i* to Joint friction hat not baan accountod for In that* transfer 
functions but will Introduced later at a fora of feedback* 2 Tha residues k Q and u 0 correspond to the rigid 
body node* 


lc 

s 


cjs) • JL ♦ ♦ — 

% 


M ♦ u, 


» 4 ♦ y . 


7 4 ... 4 - 


V 4 % 


% ♦ 


(i) 


The subscript x on (s) raprasanu the output variable of Interest. For tha present study four transftr 
functions ware required. Equation 2 summarizes these and defines the noutlon used hare. 


hub angular position' 


0<») ‘ 



hub angular rate 

• 

*0(») 

m 

Gj(s) 

beam tip position 


v L (*) 


G vl<*> 

beam tip rate 




. G w. (,) . 


Here s Is tha Laplace operator and T(s), O(s) and ( s ) denote the Laplace transforms of the Input torque* 
hub angle and tip position variables* respectively. 

Figure 3 Is a block diagram representation of the transfer function. 



Figure 3. Block Diagram Equivalent of Equation 1 
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THt tystMi Mtrlcts corresponding to Figure 3 «r« as follows: 


1 

C 

\ o 

** 2 - 2 %<« 

I • [ 0 1 0 1 ••• 0 1 } T (3) 

Si “ C M o x o “I S •" *S» N . 1 

Tho loading principle 2x2 submatrlx of A represents the rigid body node In the tip and hub position transfer 
functions. 

For each of the output variables x there Is a unique output matrix . He will denote thase as 

C e , Cj, C yL and C* L with the noutlon carrying the obvious Meaning. These are assembled to forts a 
Measurement matrix representing four outputs as Indicated In the measurement equation (4) given by: 

* 

C 9 

C i 

, * *C* («) 

l 'yt 

C \ 

In order to account for viscous joint damping we consider the feedback system Illustrated In Figure 4. 





Figure 4. Block Diagram Illustrating Feedback of Joint Damping 


When the effect of joint damping Is Introduced through boundary condition C eedback. a new A matrix Is 
formed: 

A « A - BCjb (5) 


Where b Is the joint damping coefficient. The analysis that follows Is based upon a system model of the 
form (A, B, Q) . 


4. System Representation 

We wish to design a controller for a plant with 2(n+l) states, using a design model Including only one 
vibration mode. Here n represents the number of flexible vibration modes required to provide accurate 
representation of the actual plant, and the additional two states represent rigid body motion. Controllers 
developed for the single mode system are applied to a model Including three vibration modes (8 states), and 
one Including six vibration modes (14 states). These are referred to as the plant models In the text that 
follows because they are Intended to represent actual plants In the simulations presented here. 
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Tta tyttM tQuatlMt for tho flout aodolo art |1*on by: 

i„(t) • ^ (t) ♦ l„« (t) «) 

* (t » * So % (t » 

Here ^ It tut J metrlx defined In Equation 5, la tMt cm for a* a mode approximation of the plant. 
Similarly tl« ^ aatrli Is the ( matrix of Conation 4 for an n mode approx 1 notion. 

THa system equations for tHa design noOals art given by: 

^(t) • ^ «j(t) ♦ IjiKt) (7) 

y(t) • fij ajCt) 

The system output vector (y) considered It of the fom 

y • [ e S v^ v^ ] T . (S) 


S, noftlatar Design 

The standard fort of 1 1noar quadratic cost function Is 


J c ' f (,T °* * " Tr * ) dt 


(») 


where 0 Is a symmetric, posltlvo seml-deflnl te stato weighting natrlx and R Is a symmetric, position 
doflnltt control tffort weighting natrlx. Bocauso no sock to regulate tip position, a porfomanco Index 
that penalizes tho tip position output variable and control effort was chosen. A cost function for tip 
position output weighting Is expressed as follows: 


J 


c 



♦ rt*| dt 


( 10 ) 


Tho output weighted performance Index (10) Is equivalent to the standard stato weighted version (9) with 
weighting matrices given by: 


0 




s« 


R - tr] 


(ID 


The system (A ft , B n , Q n ) represents an actual plant with dynamics that are either incompletely known or too 
cumbersome to permit the use of the full model In controller design. The four stato design 
model (A|, Bj, £j) will serve as an approximation to the actual plant for controller design purposes. In 
this case (j replaces £ n In the state weighting matrix Q (11) 

Two attractive features of the rip position weighted cost function (10) are that It has only one 
parameter (r) to vary, and that a given value of r can be expected to Impose similar performance demands on 
both systems (damped and undamped). Reducing the value of r decreases the penalty on control effort and Is 
therefore equivalent to demanding higher performance at the expense of Increased control energy. 


f. State Feedback 

In a system with decoupled modes, such as the Jordan canonical realization of Equation 3, a state 
feedback law 


u • (12) 

designed to stabilize the reduced order system (7) will stabilize the actual system (6), provided that the 
truncated modes are asymptotical ly stable. The neglected modes can, however, be excited at their natural 
frequencies In response to the applied control Input. This effect Is called control spillover [9,10] In the 
literature related to controlling flexible spacecraft. The system we are considering has a small amount of 
modal coupling, due to the Introduction of viscous joint damping using Equation 5. Since wt normally do not 
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expect viscous damping to destabilize • tystts, It Is reasonable to expect that the state feedback lav (12) 
will stabilize tf» plant models of Intorost. 

In tbls soctlon we design a control ltr based on tba output weighted performance Index (10). We at sum 
that tha first four elements of tha stata vac tor ara somehow available for feedback. Control Is applied to 
these four suta elements according to Equation 12. Tha time varying gain Kj(t) that minimizes the cast 
function (f) is given by 

KjU) - «" ! l 1 T S(t) (13) 

wham S(t) Is tha solution of tha associated matrix Klccatl equation. An often used substitution for tha 
opt leal gain Ki(t) Is Its consunt steady suta value K. • K .(•) . The steady suu value ~f SC t) Is the 
solution of tha algabralc Rlccattl aquation: 

-$(-) - $(-)*, ♦ Aj T S(-) - $(-) •,«' l 8 1 T S(.) *0-0 (14) 


Tha steady stau gain solution is used tha simulations presented hem. 


7. Simulations Kith State Feedback 

Figure S Illustrates tha simulated msponsa of tha design modal (7) to a 4.8 Inch step commend, for 

various values of r. The simulations Indicate that both tha damped and undamped systems are capable of 
almost arbitrarily good performance as r Is decreased. In practice, the limit on the msponsa tine Is 
dictated by the strength of tha beam and tha torque limit of the actuator. A theoretical (assuming fnimite 
beam strength and motor torque capacity) limit on the speed of msponsa Is discussed by Schmitz [6]. This 
limit is related to tha non-minis** phase character of the tip position transfer function. Notice mat the 
wrong way start phenomenon typical to systems with non-minimum zeros Is Indicated In the plots. Schmitz 
Interprets tha theoretical response limit as being roughly equivalent to a pure delay associated with the 
Initial period during which the tip moves In the direction opooslte to the control command. 

When the state feedback law (12) Is applied to the larger order systems (Figures 6 end 7), the 

excitation of the second mode of vibration Is readily apparent when r • 1<T*. In the undamped system 
(Figures 6a and 7a) the oscillation takes mom than two seconds to die out. Thus. In the case of the 

undamped system, we find that designing for higher performance (by reducing r) actually results In slower 
response. The excitation of the second mode also occurs In the damped system (Figure 6b and 7b). however, 
It dies out In about 0.8 seconds. Although the performance of the actual plant Is notes good as tmt of 
the design model (Figure 5b), the simulation Indicates that the response time for r* 10“* Is slightly better 
than the lower levels of demanded performnee (larger values of r) considered. This Is In sharp contrast 
with the results of Figures 6a and 7a for the undamped system. This example clearly Indicates that the 

damped system Is less susceptible to control spillover than the undamped case. 

It should be noted that the peak control torque command, when r • 10“*, Is about 4800 Inch pounds. 
This value Is well above the beam's maximum bending moment capacity (~ 175 In. Ibf. based on yield) end Is 
about 60 times greater than the rated torque capacity (85 In.lbf.) of the experimental system's motor. In 
light of these figures, one might argue that control spillover Is not a realistic concern for the system of 
Interest. The author concedes to the somewhat artlclclal nature of this example, however, further 

consideration of the results adds to their significance. Suppose the Initial step command Is scaled down by 
a factor of 50 to about 0.1 Inches. Because the system model Is linear, we know that the corresponding peak 

torque Is about 100 Inch.lbf. This Is a realistic figure for the system of interest. In Figures 6a and 7a, 

peak tip position oscillation amplitude Is about 1.5 Inches. Scaling this figure down by a factor of 50 
gives 30 thousandths of an Inch - a significant value In the context of robot accuracy. • 


8. Output Feedback 

The simulations presented In the previous section were based on the assumed availability of states. 
Practical control systems must depend upon measured outputs for feedback. Frequently the outputs are 
different entitles than the states. In contrast to systems using state feedback, output feedback systems 
are subject to Instability as a consequence of model reduction [9-11] even when the neglected modes are 
asymptotically stable. This effect Is sometimes called observation spillover. 

In this section we follow the steady state LQR controller design approach employed In the previous 
section, however, we Implement the controller using output feedback. The design model (Aj. Bj, Cj) has four 
states and four outputs and the measurement matrix Cj Is Invertible. This allows us to calculate the state 
Xj of the desing model from the output vector y according to: 


x 


1 


- c i 




y 


( 15 ) 
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a) Undaapad Cast 


b) Oaapad Casa 


Flgura 5. Closad Loop Sup Rasponsa Using Stiti Faadback, Ona Vibration Moda Plant Nodal 









In this cist the output feedback control U-.t Is given by: 


« * Htj (16) 

The output feedback law (16) » when applied to the dtslgn modal Is equivalent to stata feedback (12). When 
applied to controlling tha actual model, the output faadback control law (16), expressed In the form of a 
state faadback law Is given by: 

• ■ -*! S, M S„ *„ W) 

Notice that whan appllad to tha plant modal, tha output faadback law receives Information from tha states 
that wart neglected In design. This Is unwanted Input (spillover), and can be viewed as a form of 
measurement corruption. 


9. Simulations With Output Feedback 

Simulation results obtained using output feedback are presented In Figures 8 through 10. Figures 8a 
and 8b are simulations of the design model using the output feedback law (16). When applied to the design 
model the output feedback law considered Is equivalent to state feedback (Figure 5). This case Is preseated 
here as a basis for comparison. When the low order output feedback law (16) Is applied to the actual system 
models (Figure 9 and 10) we observe that the performance Is limited by the onset of Instability. In the 
undamped system, the first vibration mode Is unstable for r ■ 0.01 and r ■ 0.005. The damped system remains 
stable under the same conditions, however, some first mode oscillatory behavior becomes evident as ve 
attempt to design for higher performance. The damped system Is not Immune to tne Instability experienced by 
the undamped case, however, due to Its more favorable open loop pole placement It 1$ more robust. 

Upon comparison of the six mode and three mode systems, we find that the stable responses of the six 
mode plants do not differ noticeably from those of the three mode plants. On the other hand, the divergence 
rate of the unstable oscillations Is greater In the six mode plant (Figure 10a) than In the three mode pUnt 
(Figure 9a). This Indicates that the presence of the higher, neglected modes (4th, 5th and 6th) do affect 
system stability slightly. 

This example Illustrates that the passive damping treatment considered reduces the flexible system's 
susceptibility to observation spillover Induced Instability. The peak torque commanded at the highest 
performance (when the system Is stable) was about 80 In.lbs.. Indicating that the performance demanded ms 
reasonable for the system under consideration. The example employs perhaps the most simplistic of ill 
possible output feedback schemes. Systems employing state estimators also rely on measured data for 
feedback, and they too are subject to Instability due to modeling error. 


10. Conclusion 

One form of modeling error that Is relevant for control of flexible structures results from Ignoring 
high order vibration modes In the process of deriving a design model. The effects of this type of modeling 
error are mnlfested as control and observation spillover. We have presented simulations of multivariable 
control a particular flexible arm to Illustrate that the addition of passive damping yields a system that Is 
less susceptible to these undesirable effects. 

To some degree these results follow Intuition, In that one naturally expects that Increasing the 
damping terms of a system's eigenvalues will provide a more stable system with Improved performance. The 
results presented are Intended to demonstrate the concept of passive damping on an example that Is 
representative of practical lightweight manipulators. 
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Figure 7. Closed Loop Step Response Using Output Feedback, Six Vibration Mode .-Plant Model 
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APPENOIX A - System Parameters and Dimensions 


Joint Inertia 
Payload Mass 

Joint Damping Coefficient 
Beam Dimensions 
Material 


J ■ 30.2 1n 2 .lbm. 
m * 0.09 lbm. 
b « 0.10 In.lbf. s z 
48" x 3/4" x 3/16" 
6065-T6 Aluminum 


APPENDIX 8 - Transfer Function Poles and Zeros 


Table B-l Undamped System 


Mode 

System Poles 

Hub Angle T.F. Zeros 

Tip Position T.F. Zeros 

1 

-0.0541 

*37.726 

-0.0149 

*32.1129 

*8.3410 

2 

-0.1284 

*318.3456 

-0.0985 

*314.0768 

*45.0741 

3 

-0.2957 

*342.2446 

-0.2853 

*340.7557 

*111.3047 

4 

-0.5769 

*382.4188 

-0.5719 

*381.6939 

*206.9715 

5 

-0.9633 

*3137.6207 

-0.9603 

*3137.1836 

*332.0743 

6 

-1.4532 

*3207.5965 

-1.4511 

*3207.2946 

*486.6130 

Table B-2 

Damped System 




Mode 

System Poles 

Hub Angle T.F. Poles 

Tip Position T.F. Poles 







1 

-.22197 

*37.1601 

-.0176 

*31-9443 

1 -7.2785, + 7.8830 

2 

-.9124 

*317.4515 

-.6202 

*313.2581 

2 -45.9556, +44.7292 

3 

-2.3169 

*341.6716 

-2.2342 

*340.1610 

3 -97.7161, +113.9306 

4 

-4.8477 

*383.2589 

-4.7987 

*382.5101 

4 -263.5548, +216.0952 

5 

-8.5293 

*3142.4761 

-8.5042 

*3142.0184 

5 -389.2245, +351.1999 

6 

-11.7674 

*3219.5993 

-11.7581 

*3219.2929 

6 -554.2140, +519.0317 
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1 . 

~ In advanced robot control problems, on-line computation of Inverse Jacobian solution I 

Is frequently required. Parallel processing architecture Is an effective way to reduce j 

computation time, i n t hi s - -paper, j. parallel processing architecture Is developed for the / 

Inverse Jacobian (Inverse differed!*] al kinematic equation) of PUMA arm,£4£. The proposed / 

pipeline/parallel algorithm can be Implemented on IC chip using systolic linear arrays. / 

This Implementation requires 27 processing cells and 25 time units. Computation time Is 

thus significantly reduced. J 

2. Introduction 

In many advanced robot control problems, such as with sensor guided manipulations. It Is essential that the 
end effector be appropriately controlled In Cartesian coordinates so that the robot can adapt to a changing 
environment. This means that we need to compute the Inverse Jacobian In real time to provide the required 
differential change In joint variables for a desired differential change In position and orientation. The speed 
of this computation directly affects the speed of robot operation. Thus efficient algorithms for computing the 
Inverse Jacobian are needed. 


There have been efforts made recently In developing computationally efficient algorthlmc to solve the 
Jacobian problem suitable for serial computer Implementation [1,2]. In addition some work has been reported In 
algorithm development for implementation on pipelined or parallel computer [3]. These results show that such 
parallel algorithms can reduce computation time significantly. 

A more Important requirement in robot manipulation Is the computing of the Inverse Jacobian solution. This 
is generally a troublesome problem when we try to Invert the Jacobian numerically. A more direct approach Is to 
derive an explicit solution of the Inverse Jacobian for a given robot. Paul, Shlmano, and Mayer [2] have shown 
that such solutions can be obtained by differentiating the kinematic equations. This approach has shown to 
result simpler Inverse Jacobian solutions with regard to manipulator degeneracies and joint constraints. The 
inverse Jacobian of the PUMA arm has been solved specifically In [2]. 

In this paper, we present a pi pel 1 ne/paral lei algorithm and architecture for computing the PUMA arm Inverse 

Jacobian derived In [2]. With rapid advances in VLSI technology, this type of algorithm can be readily 

Implemented on IC chips. These special purpose chips can be connected to a host computer system to achieve 

real-time Cartesian space control at sufficiently high sample rate. It Is noted that a study has bet.i made 
recently to Implement direct kinematic solution on VLSI chips to speed up computation time [4]. The goal here 

Is to further exploit the advantages of VLSI technology for the aeslgn of customized chips dedicated to the 

computing of the Inverse Jocoblan of PUMA arm. 

3. Differential Kinematic Solution of PUMA Arm 

Differential changes In joint variables dqi can be related to the different changes in translation and 
rotation dx, dy, dz, fi x , fiy, and fi 2 of the end effector by the relationship 

[d x * dy* dz* fi x , fiy, 67,] * J [dqi, dq2,...»dq n ] ( 1 ) 

In which n Is the number of joints, and J Is the Jacobian matrix. But in advanced robot control problems, we 

need the solution of dq-j given the desired differential change d x , dy, d z , fi x , fiy, fi 2 . That is we need to 

compute the Inverse problem 

[dqi, dq 2 ,...,dq n ] T - j’ 1 [d x , d y , d 2 , fix, fiy, fiz] T (2) 

This represents the Inverse differential kinematic solution (Inverse Jacobian) of the robot arm. 

Instead of relying on the direct computing of the Inverse Jacobian matrix J~* , an analytical solution of 
the Inverse Jacobian problem can be frequently obtained, and such a solution for the PUMA arm is given in [2]. 
For the PUMA arm, the joint variables are the six rotational joint angles e^, ©^....©g. Furthermore, the 
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differential changes In translation and rotation can be related to the differential change of the end effector 
honogeons Matrix T [2]: 


dT - T 


(3) 


where 


dT 


T - 

n o a p 
0 0 0 1 


dn x 

do x 

<u x 

dPx" 

dhy 

do y 

day 

dpy 

dn 2 

do z 

da z 

dpz 

0 

0 

0 
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0 

-«z 

«y 

dx" 


0 

•fix 

dy 

-6y 

«x 

0 

dz 

0 

0 

0 

0 


AT = 


Therefore differential changes In translation and rotation can also be specified' In terms of the x, y, z 
elements of dp, do, and da (dn vector Is redundant). The desired solutions of doi In terms of dp, do, and da 
for the PUMA arm obtained In [2] are given In the appendix. A plpellne/parallel processing architecture fcr 
computing these equations Is now developed below. 

4. Systolic Array Processing 

VLSI technology has created a new architecture horizon In Implementing parallel algorithms directly on 
hardware. Central to this architecture Is the use of systolic linear arrays which consist of Inter- 
connected simple and mostly Identical processing cells. Algorithms that can be executed using Identical 
operations simultaneously can take advantage of the systolic array architecture to reduce computation time. 

The processing cell ftructure we will employ Is the “Inner product step processor* which performs 
matrix-vector multiplication using one-way pipeline algorithms. For example, computing 

Ab * p 

where A Is nxm and b Is mxl , can be carried out In the following recurrence manner: 

pi°) * 0 


( 4 ) 




p(*0+ a k 

*1 a 1k D k 


l,m, 1 ■ l,n 




p(») 


This operation can be Implemented by a linear array of m Inner product step processors shown In Figure l. 

In the following section, we will reformulate the Inverse differential kinematic equation given In the 
appendix In terms of a set of matrix-vector multiplications which can be computed In parallel and pipelining 
fashion. 

5. Algorithm Development 

In this section, we present the matrix-vector multiplication processing schemes for computing the 
dl fferentlals doi, 1 - 1,2,.. .,6. Here we assime that the trigonometric functions required are available. 
Typically these functions can' be generated by employing ROM look-up techniques [5,6]. The algorithm Is broken 
down Into 15 steps as described below. The notation $ S1ne>. * CosOj are used. 

(1) - Pl 

dpy dp x p x py ay a x da x -a x day Oy o x do x doy 

-dp x dpy py -p X -a X 3y d3y “fly -dfly — 0 X Oy dOy -dO X 
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b i ■ l c i s \] p i * ( p ir # " p i9 p no** * p ii3 J 
output: dOj • Pxx/Pl3 

(2) f x mj ♦ g 2 ■ hj 

f l ■ l p 14 p 15 p 18 P 110 ’ p Ull m i ' do i 
®1 * t p 12 P 17 p 14 p 112 P 113 1 
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[ h ll* 

... 

h i 5 l 


(3) 

A 2 b 2 

’ p 2 





4" 

r-.3 

<*4 

Pz 

“PI 3 



L d4 

*3 

P13 

Pz 


P 2 " 

[ p 21* 

... 

p 24 1 


(4) 

f 2 m. 

! * g 2 

- 

h 2 



'I" 

( C 3 

S 3 

P 21 P 23 P 24 1 


9 T 2 * 

(*3 

d 4 

0 0 

°1 


* 

a 2 . 

■4 

■ ( h 21 

• * • • ^2^ | 

(5) 

*3 p 3 

* p 3 





a 3 - 

[* p z 

p n) 

b 3 ‘ l dp 


output: dOj * P3i/ h 23 

(6) A 4 b 4 - p 4 



P5 ' (P 51 P S2 1 



b 5 * t dp z h ll d0 3 1 


(8) Agb6 ■ P6 

a T m p 42 * a z p 16 “ p 51 b 12 da z p lll ~°z h 14 ‘ do z 

6 - p 41 - p 16 “t - p 52 - da t h 12 -°/ * P 1U " d °z ' h 14 


outputs: d023 a P64/P51 d0 ? " d0 23 " d0 3 
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(9) f 3 m 3 

'S' 

«3 ’ 


(10) A ; b ? 


+ g 3 - h 3 

[ p 3 p 63 p 68 * p 67 1 
t p 65 p 66 p 69 P 61ol 

( h 31* * * * h 34 1 
‘ p 7 

P 15 p 63 
p 32 * h 13 


m 3 * d0 23 


p 15 

p 63 


output: do 4 ■ p 7 g/Pn 

(11) A 8 b 8 . p 8 


f p 15 P 31 p 67 

A 8 ■ 

[_* p 63 h 13 P 1 16 

»8 * fC 4 S 4 ] 

(12) f 4 m 4 + g 4 ■ h 4 


p U0 h 33 " p 67 h l5 ' 
* p 67 h 15 * P U0 ’ h 33. 

p 8 * t p 81 * * * * p 87 1 


P 7 ' 


p 7l" 

p 72 


f 3 * t p 81 p 84 p 86 1 
9 4 * t Pg2 p 85 p 87 1 


(13) 

Agbg* 

p 9 




’ h 41 

_h 32 


a 9 • 

- p 68 

p 83 



_* h 42 

- h 34 


output: ' 

dog * 

(14) 

f 5 m 5 

♦ s 5 

* h 5 


f 5 * 

t p 92 1 


(15) 

A 10 b 10 3 p 10 


A 10 3 

' t h 51 

- h 43 


m 4 • do 4 

h 4 " ( h 4 i h 42 b 4 3 1 

p 9l" 
p 9 * P 92 

_ p 93_ 

[P93] m 5 * d0 5 h 5M h 5l] 
b 10 * t c 6 s ol p 10 * t p 10ll 



output: dOg * Pjoi 


The data flow timing table for these computations are given in Tables 1 and 2. It Is shewn that the 
solution requires 25 time units and 27 processing cells. 

The results of 25 time units is a significant reduction of computation time in comparison with that when a 
serial computer Is used to compute the original solution. The total number of multiplications of that solution 
is about 150. This is equivalent to 150 time units In the systole array processing system as opposed to the 25 
% time units we have achieved by exploiting parallelism. 
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Tabl« 1. Oata Mow timing table for steps 1 through 8 which compute doi 863 and d03. Nunbers 
on top row indicate time units. 


Px Py 


Py *Px 


ay -a x 

dv a 


x °y 

da x day 

-«x " a y 

day -da 


iy -a a x 

Oy o x 


O x Oy 

dOx dOy 


dOy -do x 


-33 d4 

d4 a3 

P 7 


Pl3 

-Pl3 P 2 


«2. a 3 
da 


p z * p 13 


p 13 P Z 


h 21 * h 22 “ h 24 

h 22 h 2 i h 25 
P42 P 41 


p 16 -4 Z 

* p 5l ' p 52 

hi 2 -da 2 

da 2 hi 2 


p lll -07 

-o 2 * p Ul 

1*14 -d<> 2 

-do z -hn 
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Tabic 2 . Oata flow timing table for steps 9 through IS which compute dee, d 0 5 . de$ 


10 11 12 13 14 IS 16 17 18 19 20 21 22 23 24 2S 


p 64 p 65 

P63 P66 

P68 p 69 

- p 67 P610 

PlS P63 Pi 5 

h3i -hj3 P63 

p 15 -P63 Ce 

h3i hj3 S 4 

P67 Pi 10 


PllO 'P67 

h 33 *15 

* p 67 "PllO 

h^ -h33 


P 8 i d0 4 • p 82 

p 36 p 85 

p 86 p 87 

*>41 * h 32 C 5 

-pGG P83 S 5 

-h42 -h34 

P 92 d0 5 * P93 

* h 43 C6. S 6 


6, Conclusion 

It has been demonstrated In this paper that parallel computing architecture can be developed for the 
Inverse differential kinematic equation of the PUMA arm. By using systolic linear arrays employed in VLSI chip 
design, the computation can be completed with 27 processing cells in 25 time units. 

The differential kinematic equation in Its original form requires about 150 multiplications to compute. If 
one multiplication Is counted as one time unit, the parallel architecture definitely provides a substantial 
reduction in computation time. A customized 1C chip dedicated to this algorithm can be fabricated. 
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Appendix 


de, 


d0n 


: Inverse Differential Kinematic Equations for PUHA arm 

C,dP y - SjdP x 

■ c i<*v;' 

d i 

*2 (d 4 c 3 " a 3 5 3^ 

dj • f jj ( p)df jj ( p) * Pj 2 (p)df j 2 (p) 
flj(p) ■ CjP,, • 

dfjj(p) ■ -^P x dei * Cjdp x ♦ CjPydej ♦ Sjdp y 
f 12 (p) - -dp z 


de 23 


-S 23 d « 2 - C 23 d ^ 1 
C 23 v 2 * S 23 u l 


V 1 * “ w 2*ll^ * w i p z 
v 2 " u l f ll( p > + 2 

* d 3 ^ 

* il 2 " d 4 * *2^3 

dvj ■ -L^dfjjfp) “ a A 2 f n(p) C 3 d0 3 + u l dp z ' a 2^3 p z de 3 
dv 2 ■ w 1 df 11 (p) - a 2 5 3 r U^ dfl 3 * “2 dP z + a 2 C 3 p 3 de 3 
de 2 * d0 2 3 - ^63 

NC 4 d(NS 4 ) - NS 4 d(NC 4 ) 

d0 A ■ n n 

4 <ns 4 ) z ♦ (nc 4 ) z 

NC 4 * C 23 D 41 - S 23 a z 
°41 * c l a x * s l a y 

d(NS 4 ) • -C 1 a x da 1 - S 1 a y de l ♦ Cjda z - Sjda x 
ns 4 * * s l a x * c l a y 

d(NC 4 ) * -S 2 30 41 d9 2 3 ♦ C 23 dD 41 ' ~ C 23 a z de 23 ' S 23 da z 

dD 41 * * S l a x d9 l + C l da x * C l a y d9 l + S l da z 
dflc 3 C^dS^ - S^dC^ 

dS 5 • -S 4 NC 4 de 4 ♦ C 4 d(NC 4 ) ♦ C 4 d9 4 NS 4 ♦ S 4 d(NS 4 ) 
dC 5 * C 2 3d9 23 D 41 ♦ s 23 d0 41 * S 23 a Z d9 23 + C 23 da Z 

s 5 ■ c 4 nc 4 ♦ s 4 ns 4 

C 5 * S 23 3 41 + C 23 a Z 


3-23 



“ ^6 d *6 " *6*^6 
S 6 " ‘ C 5*61 ' S 5*612 
Cj ■ -S« n 6H * C 4 M 61 12 

dS 6 " S 5*61 d# 5 * C 5 d *61 ' C 5*612 d, S * S 5 dW 612 
dC 6 • -dS 4 N 6n - S 4 «« 6ll ♦ <tf 4 N 6U2 ♦ C 4 dN 6112 

* - C 4*6U d *4 ' s 4 d *6U + * S 4*6112 d# 4 * C 4 d *6112 
*61 * c 4*611 * S 4*6U2 
*611 ‘ C 23*6111 * S 23°z 
*6112 ’ * s l°x * C l°y 

dN gl - -S 4 N 611 de 4 ♦ C 4 dN 6U + C 4 N 6n2 d9 4 ♦ S 4 dN 612 

d *6U * " S 23 N 6111 d9 23 * C 23 d *6111 ' C 23°z de 23 * ^^z 
d *6112 ‘ * C l°x d9 l - S l d0 x * S l°/ 9 1 * c i d0 y 
*6111 * c l°x * S l°y 

d *6111 * *W 9 1 * c l d0 x * C l°y d9 l + s i d0 y 
*612 * * S 23*6111 * C 23°z 

d *612 * " C 23*6111 d9 23 * S 23 d *llll + S 23°z de 23 * C 23 d0 z 


b 


k 


.<*> 



Figure 1. Inner product step processor 
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1. Aba trace. 

Conaenta on th« application to rigid link manipulators of Geometric Control Theory, Resolved Acceleration 
Control, Operational Space Control, and Nonlinear Decoupling Theory are given, and the essential unity of these 
techniques for externally linearising and decoupling end effector dynamics la discussed. Exploiting the fact 
that the mass matrix of a rigid link manipulator is positive definite - a consequence of rigid link 
manipulators belonging to the class of natural physical systems - it is shown that a necessary and sufficient 
condition for a locally externally linearizing and output decoupling feedback law to exist Is that the end 
effector Jacobian matrix be nonsingular. Furthermore, this linearising feedback Is easy to produce. 

2. Introduction. 

Because of the difficulty in controlling rigid l Ink manipulators, along with a primary concern in 
controlling end effector (EF) motions, it is natural to ask if a nonlinear feedback law exists which will make 
an EF behave as if it has linear and decoupled dynamics. It has been known at least since the early 1970s 
[ 1 1 - { ^ 1 that exact linearisation of manipulators in joint space is readily accomplished by the so-called 
Inverse or Computed Torque Technique. Efforts to accomplish decoupted linearisation of EF motions directly in 
task space began soon thereafter as is evident in the work of fbl-fUj. 

The work of [b], although concerned only with controlling the tip location of a three-link aanipulator in 
the plane, Is surprisingly prescient in Us approach in that it proceeds by the three explicit steps of 
1) decoupled linearization of tip behavior; 2) stabilization of the resulting tip dynamics; followed by 
3) trajectory control of the now linearly behaving tip. Such clarity of approach will only be retrieved in the 
latter work of fl9)-(22j« The work [6j also presages future work in its dealing with the problems of 
manipulator redundancy and actuator saturation. 

With hindsight, the work (6] can also be viewed as a direct precursor to the development of the Resolved 
Acceleration Control (RAC) approach to the end effector tracking problem {71(8). RAC essentially extends the 
work of (6] to the case of a full six dot manipulator yielding linearized F.F positional error dynamics and 
almost linearized EF attitude error dynamics (the extent to which attitude error dynamics are "almost’* 
linearized will be discussed below). The work of (71(8), however, did not make clear the three steps of [ 6 1 
and consequently appears to have not been appreciated as a technique for performing decoupled exact 
linearization of EF motions, but rather as a technique for end effector tracking which has (almost) linear 
tracking error dynamics. The fact chat the attitude error dynamics are not completely linearized also 
apparently obscured the appreciation of RAC as an exactly linearizing control technique. 

The work of applies Nonlinear Decoupling Theory (NDT) to provide decoupled 1 inear izat ion of a 

man i pula tor EF with simultaneous pole placement of the linearized EF dynamics. The abstruse formulation of 
this approach has apparently discouraged serious comparison with other approaches, the notable exception being 
!23) where correspondences to RAC and the Computed Torque technique have been noted. The simultaneous pole 
placement and l inear izat ion of EF dynamics represents a blurring of the distinct steps l and 2 described above 
tor the approach [b]. 

In [121-flM, manipulator dynamics are expressed in the task space, or Operational Space of the EF. The 
resulting nonlinear effective end effector dynamics are then linearized by the Computed Torque method. Thus, 
the Operational Space Control (OCS) of fl2]-[l**J can also be viewed as a Generalized Computed Torque 
technique. In ( 1 2 1 correspondences to RAC and the Computed Torque technique have been noted. 

Recently, Geometric Control Theory (GCT) based techniques tor exactly externally linearizing and 
decoupling general af f ine-in-the-input nonlinear systems have been developed (15j-fl9l. These techniques 
provide constructive sufficient conditions for local decoupled external linearization which, if satisfied, 
produces the linearizing feedback law. GCT has been applied to exactly linearizing end effector motions in 
[ 1 9 1 - ( 22 1 . The work of (I9)-{22) also provides a clear and mature control perspective which keeps the 
following steps distinct: l) Exactly linearize and decouple end effector dynamics to a canonical decoupled 

double integrator form, i.e. to Brunovsky Canonical F rm (BCF); 2) Effect a stabilizing loop (pole placement 
step); 3) Pe-form feedforward precompensation to obtain nominal model following perf ormar.ee ; 4) Institute an 
LQR error correcting feedback loop. Unfortunately, to understand the theoretical under p innings of GCT requires 
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an exposure to differential |«OMtry and Lie algebra/Lie group theory which most practicing engineers arc 
unlikely to have. 

It can be shown that all of the above saamingly quite different approaches lead to the mm linearising 
control law for exact external linearisation and decoupling of RF ant ions (24), (This equivalence ia specific 
to the nonlinear system considered bars* vis. system dynamically similar to rigid link aanlpulators. NOT and 
CCT apply to a such larger clasa than this* and so the equivalence to KAC and OSC holds for system restricted 
to this class but not In general). Recognising this equivalence enables us to give a simple necessary and 
sufficient condition for local decoupled external linearisation and to give a sisals fora for the linearising 
control which la applicable to a broad class of so-called natural physical dynaaiical system (25) (26) of which 
a serial link Manipulator is but a special case. For brevity we do not discuss actuated redundant arm - for 
discussion of these cases* see (24) . 

3. Dynamics of Finite Dimensional Natural System. 

Many physical system have finite dimensional nonlinear dynamics of the form (25) (26): 

M(q)q ♦ v<q,4) ■ t? 9 C *"} 4,q C O! 


nxn T 

M(q) C R ; K<q) - M (q) > 0, V q 


where q evolves on a manifold of dimension n. For example q c R n for a Cartesian mnipulator, while q c T° for 
a revolute mnipulator. 

Typically (1) arises as a solution to the Lagrange equations: 


d_ 3_. , 3L 

dt 34 L ~ 3q 


Q 


T 


where L • T-U* T * 1/2 q T M(q)q Is positive definite and autonomous* U Is a conservative potential function* 

Q • t ♦ F are generalised forces* and F are dissipative or constraint forces. This la exactly how mnipulator 
dynamics are obtained and hence mnipulator dynamics are precisely of the form (l). System which arise in 
this way are known.as natural systems (25) [26]. It is known that for natural system not only Is M(q) positive 
definite* but V(q*q) uf (1) has term which depend on M(q) in a very special way f 2 7 1 — f 29 1 . In fact, natural 
systems are nongeneric in the class of all af f ine-in-the-input nonlinear systems (381(39). Although we shall 
only exploit the fact that M(q) is positive definite for any q, it Is worth noting that the nongeneric 
structure of (1) has recently enabled important statements to he made on the existence of time optiml control 
laws [38)-[40], on the existence of globally stable control laws f27]-(33], on the existence of robust 
exponentially stable control laws ( 34 ), and on the existence of stable adaptive control lawa 1351 - 137 ) for 
the natural system (l). 

Recognizing the special properties of the system (l) f It is not surprising that results yielding external 1; 
linearizing behavior can be obtained much more easily than by application of NOT or GOT - theories which apply 
to the whole general class of smooth af f ine-in-the-lnput nonlinear system. 

4. End Effector Kinematics and Control After Linearization. 

The system (l) is assumed to have a read-out aap of either the form 


y * h(q) c R m , *V * y * J(q)q c R®, J(q) » |- cR®*” 


( 2 : 


or of the more general form 


y « h(q) c M*\-V - J Q (q)q c R - . J Q (q) c r”*" 


(3 


where J Q dt - *Vdt is a general* perhaps nonintegrable, Pfaffian form (25) (2b), h(*) is C 2 (44) (47) and defined 
on N n t M* is some m dimensional output manifold, J or J Q is C* , and in general m and n have different values. 
Often h(*) is smooth (i.e. C*) or even a dif feomorphism when the domain is suitably restricted. In subsequent 
discussion^ q will mean that J can be either J or J Q . Let the state of system ( l ) be (q,q). Then for 

y * h(q), 41 • J(q)q will be called the '‘velocity associated with the output y." No te^ that (2) is a special 
case of (3) where -V, the velocity associated with y, is just “V * y and giving J = J * 3h/3q. Also t;ote 

that for the case (3), since h is C 2 , it is still meaningful to talk about y * Jq and J « 3h/3q, J(q): 

TqN 1 * a R n • Th(q)I**i but now the case where *V i y is admitted as a possibility. 
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for rigid link Mnipulators Moving in Euclidean 3-space, typically-^ - (*) c R 6 , whera x c I 3 gives the EF 
location, i tho EF linear velocity, end m t l 3 the EF angular rete of change. It is veil known that « is not 
the tima derivative of any minimal (i.e. 3 dimensional) representation of attitude, so that if » . (*) - J 0 (q>4 •• 
in (3). In this case, ve call J«(q) the "Standard Jacobian." It is also coeon to represent IF attitude by a 
proper orthogonal Matrix A c t 3x *. 


A c SO(3) - A|A t A - AA t - I, detA - +i| , 


where the coIumds of A deternine EF fixed body axes in the usual way. It is veil known that A ■ SA where 
t ht • w x v for all v c K 3 . Thus EF location and kineaatics are often given by 


y - (*,A) - h(q) c R* x S0(3), -V - (*) - J Q (q)q c R 6 A - SX, A C 80(3), tf* - S, J o <q) c R 6 *". (A) 

which should be coMpared to (3). Alternatively, ve can take (cf. (2)) 


y - (J) • h(q) c R 6 . 4 - J(q)4 OCR 3 . 


( 5 ) 


ft C Qc E 3 is a minimal representation of EF sttitude (i.e. of the rotation group S0(3)). In general B - f(A) 
for some function f(*) which la many-to-onw or undefined if the domain of f(<) on S0(3) la not properly 
restricted. That is, because SO(J) esnnot be covered by a single coordinate chart, a la not valid for all 
possible EF orientations and there will be singularity of attitude representation unless ve restrict EF 
sttitude to the region of S0(3) for which a is valid (2%) (*1](42). This restriction then forces A to be 
defined In the image of admissible attitudes, namely In soew QC E 3 . (It May be true, however, that Q » E 3 as 
In the case of Euler-Rodr Iques parameters where singularity of attitude representation corresponds to 
|S|| ■ <■ (42) ) . Typical B's era rol 1-pitch-yaw angles, axis/angle variables, Euler angles, Euler parameters, 
and Euler-Rodr iques parameters (25], (M]-[43). The klnematical relationship between Q and ui Is given by 


Q - (Kfl)w 


( 6 ) 


where fl c R 3 * 3 will lose, rank, i.e. become singular, precisely when B becomes s singular representation 
of EF sttitude. Note from (3)-(6) that 

J * (o n) V 

Generally, the standard Jacobian matrix J Q will become singular only at a manipulator kinematic singularity, 
in which case J will also be singular. Furthermore, J will be singular when Q * fl(q) gives a singularity of 
EF attitude representat ion. This compounds the trajectory planning problem for EF motions, since now we must 
plan trajectories which avoid manipulator kinematic singularities and also ensure that B(q) c Q. 

Henceforth the system (l), (2) or (1), (3) will be said to be exactly externally linearized and decoupled 
it 


4 - u c R 6 . (7) 


This is somewhat of an abuse of notation as a consideration of the system (l), (4) shows. For u 
yields 

x * c R^, u - u 2 C R" 3 , A » uA . 



“V * u 


(8) 


Although EF positional dynamics are decoupled and linearized to x - u, attitude dynamics are nonlinear and 
given by v * U 2 » A * 3A. Eq. (8) is precisely the sense in which RAC can be said to almost "exactly externally 
linearise and decouple" attitude error dynamics as was discussed In the introduction. In the case of the 
system (1), (5),^ ■ gives 

x « Uj t r\ fl ■ Uj c R*, 3 c OCR 3 , (9) 
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which can indeed be aaid to be exactly externally linearized and decoupled. Drawback* to uaing (9) are that 0 
mat always be controlled to resiain in Q* trajectories involving B say be difficult to visualise* and the 
generalised force* U£* which drives B nay be nonlntuitive. On the other hand* it is obvious how to obtain 
stable attitude tracking from (9). The advantage to using (8) is that m and A are easily visualised 
entities* while U 2 is the ordinary torque that we are all familiar with. Fortunately* despite the nonlinear 
attitude dynamics* it is possible to use (8) to per fora EF attitude tracking with asyaptotically vanishing 
attitude error (7] [8]. 

Note that once (8) is obtained, it is easy to get (9) by use of the relationship (6). If we have A » a, 

B - (l(B)tt* and B c Q so that 11*1(8) exists, then use of 


i - u, u - n“ l (o) or - n<B)«) 


( 10 ) 


gives 

b - n<a)« + n(o)<* - u. (in 

Therefore* having (8), we can perforn attitude control directly on u > U 2 » A » 2A or we can transform to ft ■ uj 
and then control. 

5. Comparison of CCT, NDT, and OSC. 

For brevity, we consider the non-redundant manipulator case, taking n • 6 in (1), and we omit derivations. 

A more detailed discussion is given in [24]. 

Note that the system (1), (5) can be written as 
or. taking Z » 

3 Z . A(Z) ♦ B(Z)r, y - H(Z) (13) 




h(q) 


( 12 ) 


where the definitions of A, B* and H are obvious. CCT asks: does there exist (i) a nonlinear feedback 

t ■ Q(Z) + B(Z)u and (ii) a nonlinear change of basis x * X(Z) such that (12) is placed into BCF? : 


h (i) • ( 2 1) (] ; ) • (?)« <■> • «• 

The constructive sufficient conditions of [19j-[22] can be applied and give the following linearizing and 
decoupling feedback law: 


x * -Mj“ l 9Jq ♦ V ♦ Mj" l u 


where 


9J 


Although 3J A J, it is true that 3Jq * Jq giving 


V' 3Ju • 1 
5qT q k • 
.k-l 3 J 


T » -MJ *jq ♦ MJ - *u ♦ V. 


(•5) 


Note that J must be nonsingular for (15) to exist. This is consistent with the theory of [19]-[12] which 
provides sufficient conditions for local linearization. Note also that to implement (15), explicit expressions 
for M, J, and V are required. 
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The NDT approach of [11], constructs the linearising feedback in the following way. For the system (13) 
define 


G(Z) 


■k(fc«4* u) )* 


D*(Z) « C(Z)-B(Z), C*(Z) - C(A)-A(Z). 


(16) 


The use of 


x - -D*-1(Z) C*(Z) ♦ D*"*(Z)u 


(17) 


will transform (12)* (13) to y » u, i.e. to (14). 


It is straightforward to show that, for A, B, and C as in (12) and (13), eq. (17) is precisely eq. (15). Note 
that in (13) we take Z - (?) and not Z » (q lf . . *q n *4n) T * T* 1 * latter choice for Z is taken in (11] and 
serves to obscure the finir' result - namely that (17) and (15) are equivalent. 


Now consider the OSC approach of (12]-[14). In (l) let V * B-G where B is the corlolls forces and G the 
gravity forces. Restrict the domain of the system (l), (5) to ensure that h(*), is a bijectlon (and 
consequently det J(q) 4 0 on this restriction). This restriction means that, as for DCC and NDT, the 
following gives a local result for external 1 inearization. In [12]-(14], the effective EF dynamics are 
determined to be 


A(y)y ♦ C(y,*) ■ F, t « J T F, A • J* T MJ, C • U-P, 


P • J _I C, U - J" T B - AJ$, q - h" l (y), q - J* l y 


08 ) 


Recall that for the system (1), Mq ♦ V ■ x, the Computed Torque technique 
M(q-u) » 0 *> q* » u, since M(q) > 0,Vq, Similarly, In (18) A(y) > 0 for 
restricted domain. Therefore a choice of 


Is to take x * Mu ♦ V, yielding 
every y ■ h(q) where q is on the 


F • A(y)u ♦ C(y,y), x - J T F 


(19) 


in (18) yields A(y) (y-u) ■ 0 «> y - u. In this sense the work in [12]-[14] can be viewed as a Generalized 
Computed Torque technique. From (18) and (19) it is straight forward to determine that x of (19) is exactly 
x of eq. (15). 

6. Derivation of a Feedback Law for Local Exact Decoupled External Linearization and Its Relationship to RAC 
and GCT. 

Recall that the system (1), (7) or (1), (3) is of the form 

M(q) q ♦ V(q,q) * x; q C N°; q, q c R° 

y • h(q) c M®; h( • ) i* C 2 ; V ■ T(q) q e R®; T(q) U C l ; (20) 

M(q) c R 11 ”*; M(q) - M(q) T >0, q c N n 

where in general, it may be that m A n, + R m , -V * y, and J ^ J * 3h/3q. 


It is assumed that a necessary and sufficient condition for h(q) to be onto some neighborhood of y * h(q) 
in M 10 is that the mapping 7(q) be onto R m , i.e. we assume that 7(q) is onto R° if and only if J(q) * 3h(q)/3q 
is onto Th/gjff 8 * R®. This is a reasonable assumption; for example, when tf* * R m , i/ « y * J(q)q*c R m , and 
7 * J » 3h/3q this is trivially true. For the case y » h(q) ■ (x,A) c M 6 ■ R^ x S0(3) and 7 * J 0 where 
if * l ] m J o^^9» the * act that x c T X R^ * r 3 an d A * 3X c T A S0(3) means that tor J Q (q) onto, we can fill out 
a neighborhood of (x,A) and otherwise we cannot. (A general element of T A S0(3) is precisely of the form wX, 
uT c skew'symmetric , so that if u * w(q) can be mapped onto R^, 3X can be mapped onto T A S0(3) [44j[47].) 

Definition LEL : The system (20) can be locally exactly linearized and decoupled (LEL) over an open 

neighborhood B®(y’)C ff® of y* c hfN 11 ) Cff with the arm in the configuration q* c h" l (y') if there is an open 
neighborhood of q * , B n (q*) C N* 1 , such that B ln (y , ) » h(B n (q*)) and if for any u c R® and q c B n ( q 1 ) there exists 


329 


a nonlinear feedback x • F(q,4»u) such that -V, the velocity associated with y - h(q) c B*(y ' )j obeys 4 ■ u. 

Note that for an E F to be LEL at y* it Must be true that y* be in the range of h('), i.e. y* oust be a 

physically attainable IF position. Also for a given EF location, y* e hOP), a Manipulator can physically be in 

only one of the possible configurations h“ l (y*). Thus we can interpret q'c h" l (y‘) to be the actual physical 
configuration of a Manipulator. If the systea (20) is not LEL at y* in the configuration q'c h~ l (y*) it tmy be 
LEL at a different configuration q*e h~ l (y*). 

TheoreM LEL : A necessary and sufficient condition for (20) to be LEL at y* c h (K°) in the configuration 

q* c h-I(y' ) is that J (q*) e R*** 1 be onto, which ia true iff a < n and rank 3cq * ) ■ a. Further*) re,’ the locally 

exactly linearising and decoupling feedback is given by 


x- M(q) £ ♦ V(q,4) 


( 21 ) 


where £ is any solution to 


J(q)C - - 7 (q)$ ♦ u. 


( 22 ) 


When n * n this gives 


X ■ -M(q)J(q) l T(q)q ♦ M(q)J(q)~ l u ♦ V(q,q). 


(23) 


0 


Proof . 
u c R". 


* • 

Necessity: Suppose that V - J(q')q' ♦ T(q*)^’ * u can 

This means that there must exist 2 f c R n such that 


be made to hold regardless of the value of 


J(q')q* * -T(q')V ♦ u. 


(24 ) 


If J ( q * ) is not onto, then Im J(q*)*R m and Im J(q‘) * R ra . Let u be such that - J(q*)q' ♦ u / In J(q'). Then 
there la no q* for which (24) holds, yielding a contradiction^ Sufficiency: By assumption J ( q ' ) Is full rank 
and onto <-> J(q') * 3h(q*)/3q is full rank and onto. Since J and J are C*, there exists neighborhoods B“(y') 
and B n (q'), y* * h(q'), such that B ra (y*) ■ h(B n (q’)) and such that J is full rank and onto when restricted to 
B n ( q ' ) . Now consider any q c B n (q’) and Its associated y * h(^) c B®(y'). Then, -V - J(q)^ ■> 

t . M » 

-V ■ J(q )q ♦ J(q)q. (25) 


Let £ be any solution to (22). £ is guaranteed to exist since Ia J(q) - R“. Take x to be (21), then 


Mq + V * x * ♦ V *> M (q - O ■ 0 ») q « (, 

which with (22) and (25) gives ^ * u. □ 


Comments: 

1) Note that this result applies to all systems of the form (20), of which rigid link manipulators are a 

special case. 

2) Note that with y c and x c R n , the fact that we need m < n can be interpreted to mean that 
there must be least as many inputs as outputs. 


-1 . -1 

3) When J * J * 3h/3q, -V ■ y, and m » n we have that x * -MJ Jq ♦ HJ u + V *> y * u when det J A 0. 
This is the sasw result provided by GCT, NOT, and OCS as seen in the last section. 

*• *« 

4) Note that in the proof we force q * £ precisely like q ■ u is forced to happen in the Computed Torque 
(sethod. In fact, for y - q we have J - I and J * 0 giving £ ■ u. Thus the exact linearizing control of 
(21), (22) is seen to be a generalization of the Computed Torque method in a somewhat different, and perhaps 
more illuminating, way than OCS. 


(*) ■ J °*- 


Let u* consider the case of EF control given by the system (l), (4). Here J * J 0 where ^ - 
In this case, when m • n, (23) Is 


1 - -MJ _1 j a + MJ" 1 !! ♦ V. (26) 

o o o 

When det J 0 * 0, use of x yields (?) - (u^)* Thi * i§ prec*** 1 ^ KAC [7J, [8], Theorem LEL can be Interpreted 
as an extension of RAC to the redundant arte case which allows for the use of a minimal representation of 
EF attitude [24]. The more general case Q < n is given by 


x ■ Mu V, J 0 £ * -J 0 q ♦ u. 


(27) 


By using the indirect form (27), x can be obtained, after £ has been found, by use of the Nevton-Euler 
recursion (45). Furthermore £ c * n obtained recursively - either directly (46), or by first recursively 
obtaining J 0 and J 0 and then solving for £ by Gaussian Elimination. The major point to be drawn here, is that 
(27) shows us how to perform exact external linearization without the need for an explicit manipulator model. 
After exactly linearizing to /?J ■ (^\j one can perform EF tracking at this stage [7] [8], or one can continue 
to the form (ll ) by the use of W (lO). 

When using (26) or (27), the only way that rank J 0 < m can occur for m < n is when the manipulator is at 
a mechanically singular configuration. Recall (section 4) that in the case when a minimal representation of EF 
attitude is used, the resulting Jacobian matrix J will be rank deficient not just for a manipulator 
singularity, but at a configuration which leads to a singularity of attitude presentation. Thus rank 
deficiency of J 0 is kinematically cleaner to understand. The necessity that rank J 0 ^ m in order to use 
(26) or (27) allows two obvious, but important statements to be made: i) For a manipulator with a workspace 

boundary (ignoring joint stops), as In the case of a PUMA-type manipulator, exact linearization at the boundary 
is impossible; il) For a nonredundant (6 dof) manipulator with workspace interior singularities, there cannot 
be exact linearization throughout the workspace interior. For a redundant manipulator with workspace interior 
singular! ties. It may be possible to avoid workspace interior configurations which cannot be exactly linearized 

by the use of self motions as described in [48] [49]. This Is related to the multiplicity of solutions 

available for £ in (27). 

It is interesting to ask just how the control_(23) fulfills the aim of GCT,as stated in (12)-(14). We 

have the nonlinear feedback (taking * y and J » 3) x » Q(Z) ♦ B(Z)u » (V-MJ~ l Jq) ♦ (MJ~ l )u which when 

applied to (12), (13) gives 


« il) ■ (o -r l j) il) * Cr l ) u - 

Consider the local nonlinear change of basis given by 

«)■ ( 55 ”) •«)■(£;”) • 

. •• •• • . 

The fact that y * Jq and y » Jq + Jq gives 

' h (1) ■ (i °) h ® • 

Writing (28) as 





u 


we obtain the BCF 


dt 





<a> y a U. 


Of course we are benefiting from the hindsight provided us by GOT tl5]-(19j. 
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7. Concluding Remarks 


Recognizing the fundamental unity of RAC, GCT, OSC, and NDT [ 7 ] - { 22 ) for exact linearization of 
manipulators, we can focus on their true differences - namely differences in Implementation detail and design 
philosophy. With the awareness that they all produce essentially the same linearizing feedback, we can ask why 
this particular feedback form is appropriate for manipulator-like systems. 

OCS and RAC exploit the specific structure of such systems. Not surprisingly, the solutions arrived at, 
reflecting the philosophies and implementation perspectives of the researchers involved, are quite distinct in 
their flavor and presentation. Yet, since the properties specific to manipulator dynamics ultimately forced 
the solution, they are fundamentally the same. (Actually, apparently only OCS worked with a perspective 
directed specifically towards decoupled EF motions, RAC is content to stop at a point just shy of the goal. It 
is also interesting that [12] apparently shows an awareness of the relationship between OCS and RAC, and the 
degree to which RAC can be said to decouple and linearize EF motions). The important point here is that 
researchers consciously exploited the specific properties of a system of interesC, but without pin-pointing 
precisely what these properties were which made the system amenable to linearizing control. 

GCT and NDT provide techniques for exactly linearizing general smooth af f ine-in-the-input dynamical 
systems. These techniques ignore any specific nongeneric structural properties that a system might have and as 
a consequence the solutions obtained are much less transparent than those of OCS or RAC. The strength of these 
approaches, particularly GCT, is that they can provide necessary and sufficient conditions for a system to be 
exactly linearizabl* and constructive sufficient conditions which produce the linearizing feedback when 
satisfied. These techniques can be applied to systems which defy our abilities to intuit or comprehend - such 
as manipulators coupled to complex electromechanical actuation devices. Interestingly, when applied to the 
problem of manipulator exact linearization the solutions obtained can be shown to be equivalent to those of RAC 
and OCS. Again the structural properties of the system forced the solution. Once a solution is known to 
exist, it is reasonable to attempt to produce it from more physical arguments knowing now that the search is 
not fruitless. This leads to a reexamination of OCS and RAC. 

The work of [ 17]— [22] stresses a perspective which serves to enable a clearer comparison between competing 
techniques for external linearization: Place the system in a standard linear canonical form before additional 

control efforts are made - this ensures that the process of linearizing the system is not mixed up with, and 
confused with, the process of stabilizing and controlling it. This perspective greatly aided the comparison of 
GCT, OCS, RAC, and NDT which resulted in [ 24 ] . In turn, this comparison focuses attention on the structural 
properties of manipulators. 

Much current research makes it apparent that systems dynamically similar to rigid link manipulators have 
important structural properties which can be exploited to achieve results which are quite strong when compared 
to those available for general smooth af f ine-in-the- inputs nonlinear systems [25)-(40j. Here we have seen that 
exploiting the nongeneric second order form of system (1) with an everywhere positive definite mass matrix and 
a 0“ locally onto readout map enables a simple form for the linearizing feedback. 
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1. AJhatracr 


Future robotic manipulators carried by a spacecraft will be required to perform complex 
tasks in space, like repairing satellites. Such applications of robotic manipulators will encounter 
a number of kinematic, dynamic and control problems due to the dynamic coupling between 
the manipulators and the spacecraft. This p a per p re s e n te a, new ana lytical modeling method 
for studying the kinematics and dynamics of manipulators in space. /'The problem is treated by 
introducing the concept of a Virtual Manipulator ( VM). The kinematic and dynamic motions of 
the manipulator, vehicle and payload, can be described relatively easily in terms of the Virtual 
Manipulator movements, which have a fixed base in inertial space at a point called a Virtual 
Ground. It is anticipated that the approach described in this pipsi-will aid in the design and 
development of future space manipulator systems. 


- / 


2, Introduction 


Robotic manipulators are potentially very useful for performing complex tasks in non-industrial hostile environ- 
ments [1,2|, such as in space. A number of studies have considered the potential applications of manipulators is 
space and the capabilities that these systems must have to achieve anticipated mission goals [3-5). These applications 
include tasks such as repairing, servicing and constructing space stations in orbit. Currently, these tasks can only 
be performed by astronaut Extra Vehicular Activity (EVA). Eliminating the need for EVA would obviously reduce 
hazards to the astronauts and mission costs. 


Unfortunately, the use of manipulators in space is complicated by the man ipulator/spacec raft dynamic coupling. 
For example, movements of a manipulator will disturb the attitude of the spacecraft carrying it. This coupling 
will adversely affect the manipulator’s precision, and reduce the on orbit life of the system by consuming excessive 
attitude control fuel. Also, any motions of the spacecraft, say due to the firing of attitute control jets, will disturb 
the manipulator. Therefore, new manipulator concepts, designs and control techniques will be required to minimize 
and compensate for the manipulator/spacecraft dynamic coupling. 

Researchers working on the control of space manipulators have focused their attention on issues such as sen- 
sor reqirements, path planning algorithms, teleoperator control (6-8] ; the problems of vehicle/manipulator dynamic 
interactions remain unresolved. 


This paper presents a new and effective analytical modeling method for studying the kinematics and dynamics of 
manipulators in space. The problem is treated by introducing the concepts of a Virtual Ground (VG) and Virtual 
Manipulator (VM). As discussed below the VG is located at the center of mass of the manipulator/spacecraft system, 
and the VM is an ideal kinematic chain connecting the VG to any point on the real manipulator. Motions of & 
system, including a vehicle, manipulator and payload can be described easily by the VM. This model has proven to 
be effective in calculating the kinematic and dynamic properties of the system; such as its inverse kinematic solution 
and workspaces. This paper shows that the VM approach can also be used*to plan the manipulator’s motions in 
order to minimize the degrading consequences of the manipulator/spacecraft dynamic interactions. 


3, A Model of Manipulators in Space 

Future space manipulator systems will have one or more mechanical arms carried by a vehicle, as shown in 
Figure 1. The vehicle will be capable of motion in six degrees of freedom, and will have reaction jets for position and 
attitude control. Although manipulators could be driven by photovoltaicly powered electric actuators, which use no 
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where 


( 2 ) 


N 

Mm = £ 

7 3 1 

Since there are no external forces, the VG is stationary in the frame N and the vector V § is always constant. 

In the following development the VM properties such as link dimensions and joint axes, for initial manipulator 
configuration axe described. Then the rules for its joint movements as a function of the real manipulator joint 
movements are presented. Referring to Figure 3, which shows the end effector VM for the manipulator shown in 
Figure 2, the i tk link of the Virtual Manipulator is defined by the vector V,*, 

Vx = D! 

V, = Hi+D, 

V N = H*-,+D* (3) 

where 

Di = Ri £ M,/M w (.• =1,2 N) (4) 

and f»i 

H, = U £ M' /Mm ( * =1,2 ft- 1) (6) 

f*l 

The first VM link represents the vehicle’s orientation. This link is attached to the VG by a spherical joint and 
its motion is equal to the three vehicle rotations with respect to inertial space. The end of the Virtual Manipulator 
terminates at the end effector, defined by a vector E, fixed in the N th VM link. 

The i th VM joint is taken as a revolute or a prismatic joint depending upon whether the i tk joint of the real 
manipulator is revolute or prismatic. The axis of rotation for a revolute VM joint, j4, is parallel to the axis of the real 
manipulator joint Similarly, the translational axes of prismatic VM joints are parallel to the corresponding axes 
of the real manipulator prismatic joints. Equations (1) through (5) define the VM and its position corresponding to 
the initial position of the system, as shown in Figure 2. The VM links will all be parallel to the real manipulator 
links in cases where all the centers of mass for all manipulator links lie on a line connecting the manipulator joints 
on the corresponding link. 

The VM will move as the joints of the real manipulator move. The angular rotations of the VM revolute joints, from 
their initial position, are equal to the angular rotations of the corresponding revolute joints for the real manipulator. 
The prismatic virtual joint translations are ratios of the corresponding real prismatic joint translations. For an end 
effector VM, translation of the virtual joint, P,, is given by: 

P i =T i t,MjM M ( 6 ) 

*=i 

For the VM in its position of construction, its initial position, the values of are taken as zero. Hence the initial 
magnitudes of Pj are zero. The prismatic joint motions, T, , are referenced to the initial position. 

If a VM that is constructed according to Equations (1) through (5), moves with the real manipulator according 
to the above description, and its link shapes and lengths remain constant as the manipulator moves, then it can be 
shown (see Appendix A) that: 

1 The axis of the i tK virtual joint is always parallel to the i th axis of the real system joint, and 

2 The Virtual Manipulator end point will always coincide with the real manipulator’s end effector. 


These properties enable the kinematic and dynamic motions of a free-floating manipulator system to be described 
by the motions of a much simpler Virtual Manipulator which has a fixed base in inertial space. The properties of the 
VM remain the same as long as the mass property of the system does not change. For example, when the manipulator 
grasps a free-floating pay load, the VM changes. According to Equations (1) through (5), the VM link lengths will 
be reduced for the addition of a payload. Virtual Manipulators constructed for points other than the end effector 
have different links than the links defined in Equation (1); and their joint movements maybe different than the ones 
described above, for example, prismatic joint translations may be the vector (P< - T<), depending upon location of 
the point used to construct the VM [9|. 
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reaction fuel, manipulator motions could disturb a vehicle's position and attitude and result in the consumption of 
excessive amounts of attitude fuel. The useful ! ! fe of spacecraft systems is often limited by the amount of reaction 
jet fuel they can carry. * 

Two approaches to solve this problem are: 1) permit the vehicle to move and compensate for the base motions in 
the manipulator task planning; and 2) plan the manipulator motions so that they do not cause the vehicle to move 
excessively. The first of these approaches requires the ability to perform inverse kinematic and workspace calculations 
for a free-floating system [10|. The second requires methods for planning manipulator motions that would self-correct 
the vehicle's orientation with little or no reaction jet adjustments. These approaches and associated issues are 
addressed here through the Virtual Manipulator technique. Assumed in this work is that the external forces/torques 
acting on the system are negligible, and that the system is free floating. Also assumed is that the system elements 
may be modeled as rigid bodies. The later assumption may not be valid if a manipulator must perform high speed 
motions. 

4* Analytical Development of the Virtual Manipulator 

The Virtual Manipulator (VM) is a massless kinematic chain terminating at an arbitrary point on the real 
manipulator. Its base is the Virtual Ground (VG), which is an imaginary fixed point in inertial space. It is proven 
below that for a given system the properties of the VM and location of the VG are fixed. VMs exist for many 
different manipulator structures, such as open or closed chains, single or many branch arms, revolute or prismatic 
joints [9-1 ij. The discussion in this paper will be limited to manipulators composed of spatial serial chains with 
revolute or prismatic connections. Although VMs exist for any point on the real manipulator, this paper deals with 
VMs whose end points coincide with the real manipulator end efTector. 

The VG is defined to be the center of mass of the manipulator system. From elementry mechanics, when there 
are no external forces, such as from reaction jets, the VG will be fixed in an inertial space. It will not move due to 
any internal forces of the system such as joint torques, or due to any manipulator motions. 

Figure 2 shows a schematic drawing of an N body spatial manipulator system. The first body in the chain 
represents the vehicle which carries the manipulator. The N th body is a combination of the payload and the last 
link. The joint is called Ji , and is the center of mass of the i th body. The vectors R* and L, connect C% to 
Ji and Ji to C, + i, respectively. The vector Ryv connects C * to the end effector. The vectors R* and Lj_| are fixed 
relative to the i th link, and hence the angle between these vectors is constant for all system configurations. If the t 1 * 
manipulator joint is a revolute joint, the vector defining the axis of rotation of J, is called A,, and the angle £, is the 
rotation of the t th joint. If the i th manipulator joint is a prismatic joint, the vector T < is defined to be the translation 
along the translational axis. If the i tH joint is revolute, then the magnitude T, is equal to zero. 




The location of the VG for this system in inertial space, the center of mass of the system, can be found by knowing 
some initial position of the system. The vector S(0) defines the initial known location of the end effector with respect 
to an inertial reference frame N. Then the location of the VG, the vector V tf , can be obtained from conservation of 
linear momentum by: 

N i-l 

V, = D s (°) - B R > + L > + T,)| MJM," (1) 

.=1 ;=l 
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Figure 3: N Body System and iU VM. 



Figure 4: A Three Body Planar System and 
iU VM. 


Table 1 gives the properties of a very simple planar manipulator and its Virtual Manipulator, shown in Figure 4. 
It should be remembered that the method is not restricted to planar systems. 


5. Applications of Virtual Manipulators 

The Virtual Manipulator approach has a number of possible applications. VMs can be used to simplify the 
inverse kinematics of space manipulators, calculate their workspaces, plan their motions and formulate the equations 
of motion [9-11]. It should be noted that ut’ng conventional methods, these problems are far more difficult for space 
manipulators than for industrial manipulators with fixed bases. In the sections below, the use of the VM is shown 
for workspace analysis and path planning. 

A. Workspace Analysis 

Since the vehicle and manipulator'dynamics are coupled, the manipulator’s motions will cause the vehicle to move 
and this in turn makes it difficult to find the manipulator workspace. In fact several different types of workspaces 
exist. In this section, a workspace called the constrained workspace, for a manipulator in space is defined, for a more 
complete discussion of space manipulator workspaces refer to references [9,10], For the constrained workspace it is 
assumed that the attitude, but not the location, of the vehicle is controlled. This can be achieved without the use of 
attitude control fuel by employing reaction wheels, or by using the self correcting maneuvers discussed later in this 
paper. 

To find the constrained workspace, a Virtual Manipulator is constructed to the end effector of the real manipu- 
lator. The joint limits of the real manipulator are transformed into VM joint limits. The workspace of the Virtual 
Manipulator is then found using conventional workspace analysis methods (12|. The real manipulator workspace will 
be equal to the VM workspace because of the following reasons. The VM end point coincides with the real manip- 
ulator end effector, and it is assumed that it is possible to control the orientation of the Brst VM link, representing 
the vehicle, with respect to inertial space. The other joints are controlled with their actuators. This workspace will 
always be a spherical shell, assuming there are no limits on the vehicle orientations. Figure 5 shows the constrained 
workspace for the simple two link manipulator shown in Figure 4, it was found using its VM. 

B. Path Planning 

In certain cases, the magnitude of the rotations of the vehicle caused by the manipulator’s motion may not be 
acceptable. For example, vehicle rotations may cause communication devices to loose their signals. Vehicle rotations 
can be controlled using reaction wheels or reaction jets. However, these devices have the disadvantages of increased 
mechanical complexity and system weight or increased consumption of attitude control fuel. 

It is shown below that the manipulator itself can be moved in such a way as to have the end effector follow a 
nominal specified path and yet have a prescribed vehicle orientation, with specified limits, without using attitude 
control fuel or requiring reaction wheels. 


338 



Tkble 1: Characteristics of Planar Manipulator 
and ita VM. 
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Figure 5: Constrained Workspace of Manipulator 
Shown in Figure 4. 

To find this motion the principle of conservation of angular momentum is applied to the system. For an n degrees 
of freedom space manipulator the following equation can be writter. 

X = P(e,X)A, (7) 

F: 3 by n matrix with elements 

X: 3 by 1 vector of vehicle inertial orientations with elements, X,, 

0 : n by 1 vector of joint angles with elements #, 


In general, Equation (7) is non-integratable, that is: 


d0 k d9; 


d*x t 

dijdiu 


Therefore, the final vehicle orientation depends on path taken by the manipulator from one position to another. It 
follows that the final vehicle orientation will change if the manipulator moves along one path in joint space sad 
returns to its initial position by another path. This is a similar notion to the one which permits astronauts to reorient 
their bodies by moving their limbs (13). This leads to a strategy for adjusting or correcting motions of the vehicle's 
orientation. In this strategy nominal trajectories are selected for the end effector and vehicle orientation. Then the 
joint motions are executed assuming the vehicle follows its trajectory. If at any point the vehicle orientation deviates 
from its desired path by more than a specific amount, a series of small cyclic motions, selected to correct for the 
vehicle orientation are added to the joint motions. 

To find the cyclical joint motions that achieve the desired vehicle/ baae orientation corrections, it is assumed that 
these motions are small enough that the end effector deviates only by a small amount from it# nominal trajectory. 
This small motion assumption permits the use of a nonlinear system model in which nonlinearities of order greater 
than 2 can be neglected. 

First, let X be a set of Euler angles defining the baae orientation with respect to an inertial coordinate frame. 
The initial and desired final baae orientations are X« and X#, respectively. The desired change in the Euler angles is 
defined by 

SX = X, - X d (9) 

Let 0o be the vector defining the initial and final joint positions at the beginning and end of the correction 
maneuver. Also let the vectors 6V and £W define small joint movements. The cloeed correction path is constructed 
by having the manipulator move along the straight lines, in joint space defined by vectors £V and 6W shown in 
Figure 6. 

For small 5V and 6 W the following equation can be obtained from Equation (7). 


3 3 3 

1 ; = t m= 1 




(*= 1.2.3) 


( 10 ) 




where 6X it 5VJ and BWi are elements of the vectors 6X t 6V and $W, respectively. In the case of a three DOF spatial 
manipulator, Equation (10) will yield three equations with six unknowns. Three additional constrain equations are 
required to solve for SV and S W. 

If vectors 6V and 6 W are parallel, the cyclic motion will not produce any vehicle rotation. Therefore it is assumed 
that these vectors are perpendicular: 

6V t . *W = 0. (II) 

Further, the magnitudes of 6W and 6V are assumed to be equal: 

6\ t • 6\ = 6 W r • $W, (13) 

and one of the elements of is chosen to be a linear combination of the other two. For example, 

*V, = (*V, + *Vi)/2 (13) 

Equations (10) through (13) yield six scalar equations with six scalar unknowns, which can be solved for the 
desired joint trsjectories, 6V and 6W. If the required correction, 6X, is large, the values of 6V and 6W may violate 
the small joint motion assumption. In this case the desired correction can be achieved by a series of m cyclical 
correction maneuvers. It is shown below that at each cycle Equations (10) through (13) do not have to be resolved 
and the final position can be achieved. 

Referring to Figure 7, T(X ; ) is a 3 by 3 matrix which transforms a vector expressed in vehicle body coordinates 
(x,y,i) into inertial or Newtonian coordinates (S 9t S 9t N M ) t when the body is at jth orientation. The transformation 
matrix for the initial vehicle orientation is T(X*). The transformation matrix for tbs desired vehicle position to be 
achieved after m cycles is T(X^). After one correction cycle, the transformation matrix ia T(X* + 6x) t where, 

T(X. + *x) = T(X,)A, (14) 

and the matrix A is the transformation matrix from the vehicle position, one cycle from the initial vehicle position, 
back to the initial position. The A matrix will not change with each cycle because the total system, vehicle asd 
manipulator, have been subject only to a rigid body rotation in inertial space. Hence after m cycles the transformation 
matrix from the desired system position to inertial coordinates is simply: 

T(X^) = T(Xt)A m (15) 

Equation (15) can be solved for A: 

A = PA , /’ n p-' (1«) 

where A is a diagonal matrix of the eigenvalues of T(X, ) ” 'T(X^) and P is a matrix of corresponding eigenvectors. 

Using the A matrix obtained from Equation (16), the change in Euler angles ($x) are calculated from Equa- 
tion (14). Then the joint correction motions for each cycle, <5V and £W, are obtained by solving Equations (10) 
through (13) The manipulator should go through the derived joint transformations (6V, <5 W ) m times to approach 
the desired vehicle orientation. However, the final vehicle orientation after m cycles will usually be slightly different 
than the desired orientation because of the neglected higher order nonlinearities. In order to achieve the desired 
vehicle orientation more precisely, the over all correction may need to be broken into several smaller corrections and 
the process repeated with a slightly different set of 6V and 8 W for each subcorrection. 




Figure 6: A Closed Path Correction in Joint Space Figure 7: Vehicle Coordinate Rotation Due to Cyclic 
for a Vehicle Rotation. Manipulator Motion. 


3^0 



The above technique is now demonstrated for a spatial 3 DOF space manipulator shown in Figure 8. The 
properties of the manipulator are given in Table 2. It is desired to rotate the vehicle from its initial orientation to 
its Snal orientation as shewn in Thble 3. hi this example, it was necessary to solve for the joint trajectories (3V and 
IW) 3 times to precisely obtain the desired vehicle orientation. The joint tr^ectoriss for these 3 cycle sets are shown 
in Figures 0 through II. Bach cycle is repeated 30 times to achieve the desired vehicle orientation. Thble 3 shows the 
system angles after each cycle set. During each cycle the vehicle os cill a t es in sympathy to the manipulator's motion, 
see Figure 12. However the mean orientation of the vehicle changes continuously and reaches at the same time 
the joints return to their initial positions. Figure 13 shows the mean vehicle orientation during the joint cycles. The 
vehicle movements during the joint motions are ± 0.1 radians from their mean trajectory. Here one can clearly see 
change in the base orientation as the manipulator joints cycle through their motion. 
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Figure S: Spatial 3 DOF Space Manipulator. 

1.25 r • » • 


1.00 y 



n -*< l * i ■ * * < «'■ * 

T).X 0.25 0.50 0.75 1.00 

Cyc*a Langtti (nondtonansionan 

Figure 10: Joint Angle Trajectories for Second 
Set of Cycles. 



Figure 9: Joint Angle Trajectories for First Set 


of Cycles. 



lhble 3: Manipulator Angles at Different Instances. 
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Table 2: Three DOF Manipulator Characteristics. 
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Figure 12: The Xj Vehicle Coordin.te for the Fi « UM 13: Me «» Vehiel « Eulw During 

First Set of Cycles. Correction Cycle. 

0. Summary and Conclusion 

In this paper, the concepts of Virtual Manipulators and Virtual Grounds are discussed. The end effector VM 
characteristics and proof of its properties for serial link with revolute and prismatic joints were presented, and some 
of its applications were discussed. This is a new concept and further research is required to demonstrate its full 
capabilities. 
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Appendix At Proof of Virtual Manipulator Properties 

First it will be proven that for a VM constructed using the rules presented in section 4 of this paper the VM end 
point will coincide with the end effector. Then it will be proven that when the manipulator goes through a movement 
the VM joint motions described in section 4 will keep the VM end point on the end effector. 

First, recognising that the system center of mass is stationary in the inertial frame N, V # remains stationary in 
this frame and referring to Figure 2 yields: 


N-l 

M, 0 ,V, = M n 8 + - L s-i - Tw-i - R*-,| + . . . + M,(S - £ (U + T< + R,)| (Al) 

Recall that if the joint is revolute T< = 0, otherwise, its magnitude is equal to the prismatic joint translations 
from the initial manipulator configuration and its direction is along the translational axis. Equation (Al) can be 
solved for S(t) as follows, 

S W . V, + + *, + 1.) + . . . + «■ + »»♦■ +«»-' ( R„_, +Tj,_i + Lff-i) (AS) 

Mtoi Mtot 

Equation (A2) can be written in terms of the vectors D # , H, and P. by using Equations (4) through (6), to yield: 

S(t) = V, + (Di + Hi + P t ) + . . . + (Dur-t + H*-i + Pjv-i) (A3) 

Using Equation set (3) and the fact that the end effector position is always equal to S(t) + R/v gives: 


E(f) = V, + V! + Pj + . . . 4- Ptf-i + V* (A4) 

It should be noted that this equation does not depend upon the existance of the Virtual Manipulator. The vector 
chain represented by Equation (A4) describes the end effector position relative to the N reference frame for all time. 

For the initial manipulator position the VM constucted according to the procedure outlined in section 4 has an 
end point described by the following vector chain, 

V, + V 1 + ... + V* (A5) 

Comparing Equations (A4) and (A5) it follows that in the initial position, when the P/a are equal to zero, the end 
effector coincides with the VM end point. 

Now it will be proven that as the real manipulator moves the VM joint motions described in section 4 will keep 
the VM end point on the real end effector. Say the manipulator goes through some joint movement, from section 4, 
the following vector chain describes the VM end point, where the P,’s are no longer zero, 

v; + v; + p; + . . . + p*_ t + (A 6) 

In the following paragraphs it will be proven that the vectors V?, V * and P* in Equation (A6) are the same as V*, 
V, and P f in Equation (A4), respectively, therefore, the VM end point will coincide with the real end effector. The 
vector V # is always constant, therefore, 

v ;=v, 
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(A7) 


The initial real manipulator links are composed of vectors Lj.j + R, and since the manipulator links are rigid, the 
magnitude of the vectors L,-j and R< and the angles between them are always constant. Since the magnitudes of 
and R< are constant, then from Equations (4) and (5) the magnitudes of and D< will also be constants. It 
can also be seen that the angles between H«-i and D, are constant. Then 

|V,*| = |H*-i + D«| VI. ■ (A8) 

The Virtual Manipulator links are composed of the Vf vectors. These links don't change their shapes and lengths 
as a function of time and since magnitudes of Vf are initially equal to magnitudes of V*, and magnitudes of V, do 
not change with time it follows that: 

|V;| = |H,_ 1 + D < | = |V < | Vt,.’ (A9) 

The magnitude of the vectors Pj in Equation (A4) and the PJ vectors in Equation (A6) are both obtained from 
the real manipulator prismatic joint translations, using Equation (6), therefore by definition, 

|p;i = |P<l *,.• (aio) 

Now it will be proven that the direction of the vectors Vf and P t * in Equation (A6) are parallel to vectors V,* and 
P< in Equation (A4), respectively. 

First it can be established that the rotations of the first VM link are set equal to the vehicle rotations and hence 
the first VM link will always be parallel to the vehicle. Therefore, 

V;=V! Vt (All) 

Since axis of rotation or translation of the first real and virtual joints are fixed relative to their corresponding first 
links, and the rotations of the first VM link is the same as the vehicle, and these axes are initially constructed to be 
parallel, then they will always be parallel. 

Now consider the case when the first real manipulator joint is revolute. The elements of the second VM link H{ 
and DJ will be parallel to L| and Rj and in turn the vectors VJ and Vj will be parallel and 

VJ = V, Vt (A12) 

because the rotational axis of the first VM and real manipulator joints are parallel, as shown above, and the magnitude 

of their rotations are equal by construction. 

In the cases where the first joint is prismatic, the elements of the second VM link HJ and D$ will be parallel to 
L| and R* and in turn the vectors VJ and V 3 will be parallel and 


VJ = V* Vt (A13) 

because the VM and real manipulator translational axis for this joint are parallel. In the same manner it is possible 
to show that 

v; = v< vt,,- (a 14) 

Also, in a similar manner all the translational axis of the real manipulator and the VM are always parallel, and from 
Equation (All), 

P*=P< Vt,,- (A15) 

Substituting Equations (A15), (A14) and (A7) into (A6), and comparing the result with Equation (A4) shows that 
the end effector will always coincide with the VM end point, and this completes the proof. 
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I • Aba tract 

A nodal reduction net hod for discrete bilinear systems Is developed which matches q sets of 
Vol terra and covariance parameters* These parameters are shown to represent both deterministic 
and stochastic attributes of the discrete bilinear system. A reduced order model which matches 
these q sets of parameters Is defined to be a q-Vol terra cova riance equivalent ^realisation (q- 
Vol terra COVER). An algorithm la presented which constructs a class of q-Vol terra COVERS 
parameterized by solutions to a Hermit lan, quadratic, matrix equation. The algorithm is applied 
to a bilinear model of a robot manipulator. 


2. Inroductlon 


While model reduction of linear systems has been extensively researched over the past few years, little work 
has been done in the area of model reduction for nonlinear systems. One class of nonlinear systems which is 
especially appealing are bilinear systems ([l]-[4]). Bilinear systems are linear In the state variables, linear 
In the control variables, but nonlinear In the state and control. One reason that this class Is of interest is 
Chat nonlinear systems which are linear In the control variables can be accurately approximated by bilinear 
models <[5],(6)). Bilinear approximations will in general have a higher order than the original nonlinear system 
and effective means for reducing bilinear models are needed. 

Most approaches to model reduction of linear systems have strived to preserve or approximate a certain 
characterizing property of the full order model. For deterministic systems this property la typically the 
Impulse response sequence or the system Hankel matrix (e.g«, (7]- [101). Model reduction of linear stochastic 
systems usually Involves the output covariance sequence or the corresponding Hankel matrix (e.g., [11] and [12]). 
A model reduction technique which considers both deterministic and stochastic properties has also been developed 
(1131— [15]) and the resulting reduced order models have been called q Markov COVERS ( cova riance equivalent 
£eallzatlons) . The model reduction problem for discrete bilinear systems has recantly received some attention. 
Hsu et al. [16] develop a method for deterministic, discrete, bilinear systems using a generalized Hankel matrix. 
Desal has proposed an approach to stochastic model reduction based on his realization theory ([17]). 

In this paper we develop a model reduction algorithm analogous to the q Harkov covariance equivalent 
realization approach for linear systems. The algorithm produces a class of reduced order models which exactly 
match a specified number of deterministic and stochastic parameters. This class of reduced order models is 
parameterized by the solutions to a Hermit len, quadratic, matrix equation. Section 3 presents the deterministic 
and stochastic attributes of a bilinear system which we will preserve In our method and defines a q- Volterra 
covarianc e equivalent realization . Next, in section 4 the model reduction algorithm la outlined. In section 5 a 
parameterization of reduced order models which match q Volterra parameters and q covariance parameters is 
formulated. Section 6 contains an application of the proposed algorithm to a two degree of freedom robot 
manipulator. The final sections are our concluding remarks, acknowledgements and references. 


3 . q-Vol terra Covariance Equivalent Realization s 

Consider the time Invariant discrete bilinear system 

n u 

x(k+l) - Ax(k) + Z (N x(k) + b.)u.(k) 
l-l 1 11 

y(k) - Cx(k) (I) 

where A and N 1 , l-l,...,n u are matrices, b^, 1-1,..., n u are n x *l matrices and C is an n y *n x matrix. The 

9tate vector x(.) Is n^*!, the Inputs Uj(.) 9 l-l,.*.,n u are scalar, zero mean. Independent Guassian white noise 
processes with Eu^JJu^k) - 6^ and for j > k, Ex(k)u^(j) - 0. The output y(.) is an n y *l, zero mean, 

stationary stochastic process. We assume that the bilinear system driven by unit Intensity Guassian white noise 
is stable in the sense that the state covariance 
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X 4 llaEx(k)x*(k) > 0 <2) 

k«- 

Is finite. It can be shown that for these Input processes the state covariance will satisfy the bilinear Liapunov 
equation 

n 

n u * a a 

X - AXA + r » i XH 1 + BB , B M bj ... % 1 . (3) 

1*1 1 u 

tfa also assume that there are no redundant Inputs or out put a (B has linearly Independent coluans sad C has 
linearly independent rows). 


Hubert 1 et el. [3] define the product * for an s*l vector a and an r*l vector b and the product [] for an 
n*nr matrix L and an n*a matrix H by 


« * bi 


•b, 


ab. 


, L □ M * [ Lj ... L f ) □ M S ( LjM ... L^H J . 


They also establish the following Identity, 

L( (Ms * b)] - (L □ M)(a * b) 


(4) 


With these definitions (1) becomes 


x(k+l) - Ax(k) ♦ N[x(k) * u(k) ] ♦ Bu(k) 
y(k) - Cx(k) 


and (3) may be expressed as 


AXA 


* + (NDX)M* + BB* , N $ [ Nj ... N 0 J . 


(5) 


(6) 


The zero initial state response of the bilinear system (5) Is an Infinite Volterra series [4], This series 
in regular form is found to be 


y(k) 


^ 1 -1 ^ k-lj £ -i 1-1 

£ CA 1 Bufk-ip + t £ CA 2 NQA 1 BMk-^-i^Mk-lj)! + 


V‘ 


V 1 V 1 


k k-i 3 k-ij-lj t , 


1,-1 


1,-1 


t t £ CA 3 NDa 2 NQa 1 l|»(W.-l,-l,)Mk-l,-i 1 )'i(W,)] ♦ ... 

1,-1 1,-1 1,-1 12 3 2 3 3 


where Identity (4) has been used repeatedly. The matrix valued function In each of the summations Is called a 
Volterra kernel, the J ch Volterra kernel in regular form Is then 

1,-1 


A *A~ k * 1 . 1 - * 1 1 “1 

W 1 !-!””' 1 !) “ CA J MDA J f, *'* NDA B 


(7) 


where * B >1, wl J and the matrix N occurs exactly J-l times. The step response, u(k)-l n for all k*0, la 

u 


y(k) 


k-i. 


j-i ij-i ij.j-i 


• • «“lj 

l 

V 1 






where 1 Is a column vector of ones with a elements. We shall call the coefficients in the step response the 

Volterra parameters of the bilinear system (5). The Volterra parameters of the j tl1 Volterra ternel are n^*n^ 

matrices. We define the set of q th order Volterra parameters as those coefficients in which the matrices A and N 
occur a total of q times. For example, 

v 2 - { CA 2 B , CANQB , cnOab , cnQnOb } . 

We now see that the step response is completely characterized by the sets of Volterra parameters. We also observe 
that for each k a new set of Volterra parameters effects this response. That la, if a reduced order model matches 
the first q sets of Volterra parameters of the full order model then it will also match the step response for 
k-0 , 1 q+1. 


In addition to Volterra parameters we are concerned with a covariance sequence for the bilinear system. 
Desai [17] anj Frazho [18] utilize a covariance sequence which includes both second moments of the output and 
higher moments between the output and input processes in their bilinear stochastic realization theories. We also 
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use this type of sequence, In particular tha aequanca of concern la 

Rq( 0) - Ey(k)y*(k) - CXC* 

R,(l) - Ey(k+l)y*(k) - CAXC* 

R,(0,0) - Ey(k*-l)ly(k) • u(k)l* - CNQXC* 

* 2 (2) - Ey(k+2)y*(k) - CA 2 XC* 

* 2 (0,1) - Ey(k*2)|y(k) • u(k.l))* - CNOAXC* 

R 2 (1,0) - Ey(k+2)[y(k) • u(k)l* - CANdXC* 

R 2 (0,0,0) - Ey(k+2)[y(k) • u(k) • u(k+l)l* - CNQnOXC* 

• a • 

• • a 

a a a 

where tha aubacrlpt Indicates tha total number of occurrences of A and N, and tha integer* in parenthesis 
represent the powers of A from left to right. A typical element of the sequence la then 

1 1 > 

A Ey(k+j-l+i J +i J _ 1 +...+l l )|y(k)*u(k+l l )*u(k+l+l 2 -t-l l )*...*o(k+J-2+i j _ l +...+i l )l* 

l 1 1 i-i *i • 

- CA j MQA j N...NQA XC . (8) 

A. with ch« Volcerra parameter* we ahall define Che act of q c>1 order covariance paraaecera, 1 , as thoaa 

9 

covariances In which the matrices A and N occur a total of q times, that la the set of second order covariance 
parameters is 

r 2 - { CA 2 XC* , cnQaxc* , CANC]XC* , CNONQXC* } . 

These sets of covariance parameters completely characterise the stochastic bilinear system. It is worth noting 
that If a reduced order model matches the first q sets of covariance parameters of the full order model It will 
also match exactly the mean square value of the output and all output and input correlations up to q steps in 
Cl... 

Consider now a reduced-order bilinear model 

x R (k+l) - A R x R (k) ♦ N R (x R (k) • u(k) ) + B R u(k) 

y R (k) - C R x R (k) (9) 

where X R (.) is an n r *l vector, n r < , y R (.) la an n y *l vector, and A^ N R , B R , C R are matrices of appropriate 

dimensions. In addition, we assume that the state covariance X of the reduced model driven by zero ^ean 
Cuasaian white noise Is the unique positive definite solution to 

X R - Wr ♦ < M R° X R>\ ♦ B R B R • (,0> 

We now define a particular type of reduced order model for discrete bilinear systems. 

Definition : The reduced order model (9), with state covariance X R satisfying (10) is a q- Vol ter ra COVarlance 

Equivalent Realization (q-Volterra C0VER)of the bilinear system (5) whenever 

V R - V 1# i-0,1 q-1 

and 

- R t , i-0,1 q-L 

where V and R_ denote the sets of l C ^ order Vol terra and covariance parameters of the reduced order model, 

R 1 R i 

respectively. 

An algorithm which constructs the q-Volterra COVERS of a full order model is our main objective. One such 
algorithm Is presented next. 
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4 • A Model Reduction Algor ithm 

Suppose that a full order model (5) and a state covariance satisfying (6) are given. The q ch obaervablllt; 
matrix ( ( 3 1 , (4 J , (16] ) of this model is 


0 - A 


Oo 


Ql.,A 

«1 


Qt-i", 

• 

• 

. % • c • * 

• 

• 

« 


• 

Vi 


«l-l N n 

u 


, 1-1,..., q-1 


(11 


The matrix partitions have dimension (n^l) 1 l n^xn x , i-0,1 , ...,q-l . We observe that the matrices Q^B an 
Q^XC* contain the same information as the sets and , respectively. Using the full order model we construe 
t!«e following matrices 


A a 

D - 0X0 fl 
q <1 9 


(12 

(13 


o £ o (axa*+(hQx)n*)o* - 0 [ a (NDX 1/2 ) ]( A (NQX 1/2 ) )V . 

q q q q 9 

As a consequence of the quadratic form and using the Liapunov equation (6) it Immediately follows tbit the ran$ 

spaces of these matrices are 

R( 0 ) - R( [ A X 1/2 (NQX 1/2 ) B 1) , R(D > - R( [ AX l/2 (NQX I/2 ) )) (K 

9 9 

and It Is obvious that R(0 ) la contained in R(0 ). 

q q 


We now compute a full rank factorization of 0 


D - PAP 

q 


(l! 


where rank(D ) * r < • n^* By virtue of the full rank factorization the columns of P form a basis for the ranj 
space of D . Introducing P*, the Moore-Penroae inverse of P, then it is well known that PP* is an orthogotu 
projector onto the range of D ([19)). We now partition P into blocks whose row dimensions are compatible wit 

N. 


P - 


q-i 


Pi - 


i-l , 


• »q~i 


(i 


and define new matrices 


q-1 


J-l, 


.,n . F - [ P A P, 


A N, 


P ^ 


P \ I 

Vi I 


The matrix G is (n +1)^ *n '(n + l)r and It must be determined such that 
u y u 

0 - hXp* , A - diag(A) , . 

n +1 
u 

where 1 is a block diagonal matrix with n +[ blocks. Given these constructions we now state our main result. 


Theorem 1: Given a discrete bilinear system {A,N,B,C,X| and a matrix G in (17) such chat (18) is satisfied tl 

A 


the reduced order model 1 A ,N , B 0 ,C ,X y 1 of order n defined by 

R K K K K » 


1 \ \ 1 ' * B R P V • C R “ P 0 • X R 


A , n - r 
r 


where r, P, A are from the full rank decomposition of 0 (15), Pq is from the partition of P (16), and 

satisfies (18), is a q-Volterra COVtK. ** 

Proof: First we will show that P is the q th observability matrix of the reduced order model (19). Using 
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d.coapo.ltlon. (IS), (18) and Che rang* apaca daaerlptlona (14) wa find Chat 7 la In Che range apace of P ao that 
(19) leada to 


P( * R M R ] - P 

which implies thsc th« partitions of P have the required structure (11) 

p t-i A » 


C R • P 1 ' 


p i-i"r 


P 1-1 N R 


• i»l,ee.,q-l . 


To show that the reduced order model satisfies the bilinear Liapunov equation we first substitute (19) into (10) 
which leads to 

A - P*pXF* P* + P + 0 BB*0*P + • 

9 <1 

Using (12). (13), (IS), (18), and by pre and post multiply by P and P , respectively, we have 

* * • 

* + * * + * + *•+* + * * + a 

0 XO • PP 0 AXA 0 P P > PP 0 (NQX)N 0 P P t PP 0 BB 0 P P . 

qqqq q q qq 


Now using the projection property of PP + we find that 


0 (X - AXA* ♦ (NQX)N* ♦ BB*)0* 

q q 

which is known to be satisfied (6). To show that the model (19) matches Volterra parameters we again use the 
projection property 

°„ B « ■ pp \ B - 01 

q R * q q 

and the matching of covariance parameters follows directly from (12) ,(13) 

°„ XdO* * PAP * * 0X0* . * 

<I R R q R q q 

Our remaining Cask is to determine the unknown matrix C In (17) in order to satisfy (18). This Is Che topic 
of the next section. 

5 • Parameterization of q-Vol terra COVERS 

To obtain a characterization of the matrix G we first examine the structure of the matrices IT and F* We 
observe that T> ran be partitioned as ^ 


V> “q 

d* 3T 

q qq 

and that the partitioned fora of the constraint (18) leads to the three relations 


(20) 


FlF 


_ * — 


- n . , PAG - d , GAG - d 

q-i * q 


qq 


( 21 ) 


The first relation is satisfied by virtue of the constuction of of F (17). It is easily seen that d ts 

* ^ 

contained in the range space of F so that the second relation Is consistent and G may be expressed as ((19)) 


G* - A '(F^J * (I - F + F)Y) ( 22 ) 

q 

where Y is an unknown matrix with dimension (n^+l )r*( n^>l )^ Substituting for G in the last relation we find 

that Y must satisfy the Hermitian, quadratic, matrix equation 

Y*KY ♦ L*Y > Y*L «■ M - 0 (23) 

* 1 (X - F + F)T l (l - F + F) - K* , L • (I - F + F)T l F + d , M - d*V - 2 - M* . (24) 

q q q qq 

By Inspection we sec chat the matrix K is nonnegative definite, and that the columns of the aatrix L are 
contained In the rany.e space of K. Based on these observations we now state a theorem which is motivated by a 
result of Crone [20], 
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Thgonta 2: Let K b« in m*n nonnagatlve definite matrix with rank t, L in m*n matrix whoa* coltmne ere contains* 

In the range space of K and M an n* n Hermit lan matrix. Then the matrix equation 

Y*KY ♦ L # Y ♦ Y*L ♦ M ■ 0 (25) 


hae e solution If and only If 

L*K*L - H > 0 and rankaVt - M) - a < t * rank(K) . (26! 

When these conditions hold Y Is a solution If and only If It has the form 

Y • K* /2 <VE 1/2 U* - K* /2 L) + (I - K +/2 K l/2 )Y (27! 

where K 1 ^ 2 Is the unique nonnegative definite square root of K and K^ 2 Is the Moore-Penrose Inverse of K 1 ^ 2 . It* 
matrix V Is an nt*s matrix, E Is a*a and U Is n*s and they must satisfy 

V*V • I , R(V) If conealn.d tn R(K) , U*U • I , I > 0 , UIU* • L*K*L - M . 


7 Is an arbitrary m*n matrix. 

Proof : It is well known that If JC • Wik is a full rank singular value decomposition (SVD), then 

K i/ 2 „ W n l/ 2 w* , k +/2 - wn +/ 2 w* 

and it follows that K, K*^ 2 , K + ^ 2 all have the same range space which Is spanned by the columns of W, an s* 
column unitary matrix. By the hypothesis that Che columns of L are In the range space of K, equation (25) 1: 
satisfied If and only If 

(K l/2 Y + K +/2 LM»C I/2 Y + 1C^ /2 L) - lVl - M 

* -f 

which Is consistent If and only If U L-M > 0. All matrix factors of this relation are 

k i/2 y * k +/2 l - vr l/ 2 u* 

* * y 

where U"U Is the full rank SVD of L K L-M and V Is any column unitary matrix of appropriate dimension, m*s. T 
find a solution Y we must solve the following linear equation 

K 1/2 Y - VE 1/2 U* - K +/2 L . as 

This equation is consistent if and only if V is contained in the range space of K. Since V Is column unitary :h 

range space of V may be any m dimensional space with rank *, solutions of (28) exslst If and only 1 

rank(L K*L-M) - s < t * rank(K). Given that equation (28) is consistent then Y Is a solution If and only if l 

has the following form 

Y - K +/2 (VE 1/2 U* - K +/2 L) ♦ (I - K +/2 K l/2 )Y 


where Y Is an arbitrary ra*n matrix. 


The results of this theorem show that the matrix L K L-M Is the key to solutions of the quadratic oaert 
equation (21). Substituting for K, L, M from equations (24), and using the rules for the Moore-Penrose Inverse o 
a matrix product (1211), we find that 


L K L - M 


qq 


- V 


T 1 ' 2 


(l 


(A 1/2 (t - F + F))( A l/2 (I - 


F + F)) 


)A l/2 F*d 


<19 


From this equation we find an interesting result on the Moore-Penrose inverse of a quadratic form which we stat 
without proof. 


Fact : The Moore-Penrose Inverse of the quadratic form F^F Is 

(FAF*) + • F* T" 1/2 (I - (T 1/2 (t - F'F))(T 1/2 (l - F + F)) + )T 1/2 F' f 


(30 


where T is a positive definite matrix and F Is any matrix which is tnul tipi Icat ion compatible. 


Using this result and equation (21) in (29) we find that 


L K L - M - d 


qq 


q q 




which Is guaranteed to he nonneqatlve definite ((22)). Thus the first part of the constraint (26) of 
will always be satisfied. 


theorem 


To show that the second part of the constraint (26) will 
desert pt 1 on ( l 4 ) , 


also be satisfied we note that from the range s^ae 
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(32) 


r - rank(D^) * rank(D q ) 

and from the definition of K (24), the relations (21) and the column dimension of F, 

t ^ rank(K) - r«nk(I - F*F) • (n +l)r - r«nk(D„ .) . (33) 

U q- 1 

Rohde (23] has shown that the partitioning (20) of the nonnegative definite' matrix 7 implies 

R 

rank(7 ) • rank(D .) ♦ rank(d - d^D* ,d ) (34) 

q q-1 qq q q-I q 

and by using (31) 

• * r«nk(L*K + L - M) - r.nk(D ) - r«nk(D„ .) . 05) 

q q-1 

Collecting equations (32) ,(33) and (34) we find that a < t and therefore the second part of the constraint (26) 
In theorem 2 will always be satisfied * 

We have shown that solutions of (23), (24) always exist and by theorem 2 they will have the form 

Y - K +/2 (vr 1/2 U* - K +/2 L) + (I - K +/2 K 1/2 )Y 06) 

where U£U* la the full rank singular value decomposition of L*K*L-M, V is any column unitary matrix whose range 
apace la contained In the range space of K and Y la arbitrary. We observe that the second term of (36) la in the 

null space of K which Is also the range space of F* . It follows that when (36) la substituted Into the 
expression for G* (22) that this term will be annihilated by (l-F*F) which represents a projection onto the null 
space of F along the range space of F* , The first term of (36) is In the range space of F, or the null space of 
F, so that under the projection (I-F*F) It remains unchanged. Therefore C* becomes 

c* - T^f 4 ! ♦ k 4 ’ /2 ( vr 1 /2 u* - k* / 2 l)) 

or by using equation (24) and conjugate transposing 

* 

G - 3V T l ( I - (I - F*P)( ( I - F^Fjr'd - r*F))*T i ) * (37) 

<1 

- d*D* ,J ) 1/2 UV*((I - F^Tf'd - F*F)) + /2 T 1 . 

qq q q-l q 

Equation (37) is an explicit expression for G which was the objective of this section. All of the freedom In C 
Is contained In the column unitary matrix V whose range space Is constrained to be In the null space of F. 

6 . Applic at ion to a Robot Mani pulator 

Consider the two degree of freedom manipulator Illustrated In Figure 1. The arm has Its center of mass at 
point C, and it may be translated through or rotated about the fixed point 0 by the force F and torque T, 
respectively. The manipulator carries a load at the point L* 



A 

Figure l. Two Degree of Freedom Manipulator 

Treating the load as a point mass and allowing for Joint stiffness and damping, the equations of motion are 
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(a ♦ M)r * + ♦ k f r - (a ♦ M)rS 2 - fUJ 2 - t 

(J ♦ I • fU 2 ♦ 2M Mr * (a ♦ N)r 2 )9* ♦ ♦ k,9 ♦ 2(a ♦ M)rr« ♦ 2H*r9 • T 

where a la tha distance fro# C to L, H la cha mass of cha load, J la the moment of inertia of Cha Joint, m la the 
mass of tha arm and 1 la lta aoaant of Inartla about C. Joint atlffnaaa and damping ara rapraaantad by k , ^ 
b r » b e» respectively. Introducing tha atata vector and tha control 

i • ( r r e l ) T , u • [ n | T 

than tha equations of notion have tha generic fora 

t • f(«) ♦ g<*)u . 

f< •) and g( .) into a 
X ((*M6J). Using tba 


- 100 kg, K - 50 kg, J 
shows tha atap raaponaa 

of tha nonlinaar aquationa of notion and tha full ordar blllnaar nodal, Tha blllnaar nodal provldaa a fair 
approx lnat ton to tha trua nonlinaar ayataa. A nor a accurata approximation could ba na^a by ratalnlng higher 
ordar tame In tha power aeries expansions. 


A blllnaar nodal of tha manipulator can ba conatructad by expanding each of tha functions 
power series and Introducing a new atata vactor which contains higher ordar tarns in 
first three tarns of tha Taylor series expansions of f(,) and g(«) and letting 

x • ( r r 9 9 r 2 rr 9r 9r r 2 )r Or 0 2 09 9 2 r* 9 3 J T 
than wa have a 34 ch ordar bilinear nodal and after discretisation It has tha fora (i). 

For purposes of illustration, tha following numerical values ara chosen: a • 1 m, a 
- 1 • 100 kg-m , and k f • 6 N/n, kg • 2.5 N/n, b f • 3 N-sac/m, bg • 5 N-sec/a. Figure 2 
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Figure J. Step Response ot Nonlinear and Full Order 3i linear Models 

Applying the mode) reduction algorithm with q*3 (matching three sets of Vol terra and covariance parameter*) 
a class of 3-*Volterra CoVEKs was obtained. These reduced models have 14 states which la a greater than fifty 
percent reduction In model order. Figure 3 shows the response of a reduced model from the class of 3-Vol terra 
COVERS and the response of the full order model to a unit pulse input with a 3 second duration. Figure 4 'bows 
the response of thes models driven by a unit Intensity Gaussian white noise process. In Figure 3 we s that the 
response of the full and reduced order bilinear models are nearly Identical tor the first 10 seconds. Similarly, 
in Figure 4 the reduced order model mimics the full order model initially. These observations are in accordance 
with the theory which states chat the response of the reduced order model equals that of the full order system 
for q steps la time. However, in both esses the quality of the response of the reduced order model deteriorates 
with time and It eventually goes unstable. This Instability Is input dependent and possibly In a closed loop 
setting the model behavior would be acceptable for greater periods In time. 
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Figure 4. Stochastic Response of Full and Reduced Order Bilinear Models 


7. Conclusions 


A sequence of sets of Volterra parameters characterizes the deteral nlst ic bilinear system, and a sequence of 
sets of covariance parameters describes the stochastic bilinear system. A model reduction technique was developed 
for discrete bilinear systems which generates a class of reduced order models which exactly match the first q 
sets of Volterra and covariance parameters of the full order model . These models are therefore called q-Vol terra 
covariance equivalent rea 1 Ixat Ions , or q-Vol terra COVER*. Methods to choose specific models from within the 
class to satisfy additional modelling considerations Is a topic of future research. 
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Abstract 


Asymptotically stable linear time invariant systems are capable of tracking arbitrary reference signals with a bounded error 
proportional to the magnitude of the reference signal (and its derivatives). U is shown that a similar property holds for a general 
class of nonlinear dynamical systems which includes all robots. As in the linear case, the error bound may be made ^arbitrarily* 
small by increasing the magnitude of the feedback gains which stabilise the system. ^ 


1 Introduction 

Tracking is the archetypal pursuit of the control theorist. Given a dynamical system, 

* =/(*.«). 

v = M*) 

and a specified “reference signal”, r(t), it is required to find a control, u # (t) such that the forced system, i = /(x,iT) “tracks" r 
in some sense — usually lim*-,*, y = r. Solutions to such problems generally involve pre-filtering the reference trajectory through 
a suitable “feedforward" algorithm, and then adding a compensating error driven “feedback" term to arrive at the input, u'. If 
the reference signal is known i priori, then the feedforward algorithm may entail pure differentiation to “pre-compensate" for 
the lags introduced by the dynamical system itself. However, on-line differentiation of unknown and unpredictable signals has 
long been eschewed by control theorists as an unreliable technique for both theoretical as well as practical reasons. 

This paper considers the problem of tracking in the context of telerobotic manipulators. It is shown that a general class 
of highly nonlinear control systems which includes all robot models admits tracking algorithms based upon high gain linear 
state variable feedback. The choice of a pure feedback based algorithm for tracking is surely not optimal in any sense of the 
word. However, the only other techniques which are known to guarantee tracking for this class of systems make use of feedback 
algorithms which attempt exact cancellation [1,2,3}, (or “nearly" exact cancellation, e g. (4,5) ) of intrinsic nonlinear dynamical 
terms via feedback, and pure differentiation of the reference trajectory in the feedforward path. In robot applications admitting 
the use of a “high level" planner it is plausible that the entire future strategy might be made available at once to the “low 
level" controller in which case tracking scheme? requiring pure differentiation of the reference signal might be acceptable. In 
telerobotic applications the reference signal is, by definition, a priori unknown: it is generated as a record of the unpredicted 
arbitrary motion of a human agent of control. Schemes which require pure differentiation will probably not be useful in this 
context. 

In a sense, the result reported here simply represents another example of the similarity between general mechanical systems 
and second order linear systems. It is well known that asymptotically stable linear time invariant systems are capable of tracking 
arbitrary reference signals with a bounded error proportional to the magnitude of the reference signal (and its derivatives). For 
• fixed bound on this magnitude, the asymptotic tracking error may be made “arbitrarily" small by increasing the magnitude of 
the eigenvalues in the left half of the complex plane. In practice, this is accomplished by increasing the gain of linear feedback 
compensators. In this paper it is shown that the analogous property holds true for the more general class of nonlinear mechanical 
systems. 

As in the theory of linear servomechanisms, a practical obstacle to the systematic use of high gain feedback techniques in 
telerobotic applications is the inevitable presence of actuator torque limitations. Practical tracking strategies which address this 
problem while maintaining convergence guarantees are very much needed. This important consideration is entirely ignored here. 
The problem of characterizing the transient response of feedback compensated nonlinear mechanical systems is the topic of a 
paper currently in progress. 

*Thi» work i a mp ported in part by the National Science Foundation under gr™* #o. DMC-S5051O0 
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2 Preliminary Discussion 

2.1 Notation and Definitions 

If f : n," _* n* haul continuous first partial derivatives, denote its m x n Jacobian matrix as DJ . When we require only a subset 
of derivatives, e.g. when x 
we may write 


, and we desire the jacobian of / with respect to the variables Ji € R*\ « » held fixed, 


0 J* 


0 ../ =Df 

If A: J R"** is • smooth map taking matrix values then let 

pi A) = sup sup |x T dx| 
M-t 


and 


i/(A) = inf inf 


x T dx|. 


If J is compact, or the entries of A are bounded then both v(A), p(A) are non-negative real numbers. For any constant matrix, 
n{A) is the square root of the eigenvalue of greatest magnitude, while i/(d) is the square root of the eigenvalue of least magnitude 
of A t A % from which it follows that 

p{A) = *up ||<4(f)|| IMA) = sup||A- , (?)ll. 


where || * || denotes the operator norm induced by the eulcidean norm of R*\ 

Given a set P t a smooth scalar valued map, v : P -♦ R is said to be positive definite at o point p€ P if v(p) = 0, and v > 0 
in some open neighborhood of p. Given » smooth (time invariant) vector field,/, on some phase, space, P, we shall say that, v, 
a positive definite map at p d € P, constitutes a Lyapunov function for f at p t if the time derivative along any motion of the 
vector field is non-positive, 

0 = D p v f{p) < 0, 


in some neighborhood of pj, and that it constitutes a strict Lyapunov function for f if the inequality is strict (6,7|. The domain 
of v with respect to p 4 is the largest neighborhood around p which is free of additional critical points and upon which the 
derivative is still non-positive. 

The existence of a strict Lyapunov function at a point is a sufficient condition for asymptotic stability of that equilibrium 
state. If a strict Lyapunov function has not been found, asymptotic stability may, nevertheless, be assured if a further condition 
on the possible limiting set holds. This is “LaSalle's Invariance Principle” (7|. It is possible, as well, to draw conclusions about 
the tracking capability of a forced dynamical system in consequence of of the stability properties of the unforced vector field at 
a particular equilibrium state. However, this seems to require the use of a strict Lyapunov function. 

It has been known for quite some time that the total energy of a mechanical system may be interpreted as a Lyapunov 
function (8|. Unfortunately, this choice of Lyapunov function is never strict. The central contribution of this paper rests upon 
the construction of a strict Lyapunov function for the general class of nonlinear mechanical systems described below, (1). The 
tracking results follow as a standard consequence. 


2.2 Dynamical Equations of Kinematic Chains 

The equations of motion of a kinematic chain have been extensively discussed in the robotics literature, and this paper will rely 
upon the standard rigid body model of an open chain with revolute joints. Thus, wc consider a robot to be a particular member 
of the class of mechanical systems^ 

+ B\q, q]q + k(q) = r (l) 

where the generalized positions take values in a configuration space, q 6 J y and M is a positive definite invertible symmetric 
matrix for all q € J. As shown in the appendix, in the case of kinematic chains, A/, the “inertial* terms, B, the “coriolis and 
centrifugal” terms, and k % the gravitational disturbance vector, all vary in q by polynomials of transcendental functions. It 
follows that i/(M) > 0 and p(M) < oo. 

This system may be rewritten in the form 

<Ii = <h 

<h = + k - r| 


(2) 


where the generalized positions and velocities take values p = 


€ P = TJ in phase space — the tangent bundle over J. 
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While M t k art always bounded, the coriolis and centripetal forces are quadratic in the velocity — i.e. B is linear in 4 — 
and, therefore, may become unbounded. It is, however, bounded with respect to q 9 as the following technical result shows. 


Lemma 1 For any k € R", 


is founded abovt. 
Proof: 


* T *ft . ft T M(k)ft 

r i . 

*D'„M J "'■* 


Bft = (ft « I) 7 D'M* ft - j [(ft ® I)'D % hf ] T ft 

and, hence, 

= fll [D,M* ] T (ft ® /)* - »ft T (ft ® l) T W k 
= «I [B.M* ] T - »,J (. 5 /) T D'M* k 
(\ k'D,„M ] \ 

= cl i ft. 

U J ) 

Since M contains transcendental functions in q , all of its derivatives in q must be bounded. 


It follows that for some £ < oo, 


«*(*)« < £11*11* 


Corollary 3 For oil p€ ?, 


Proof: 


rfl \m - B|ft H 0. 


= \q] f (ft ® l) - (ft ® /) T D,M* j ft 


2.3 Stability Properties of “PD” Compensated Systems 

Suppose we are presented with the mechanical system (2), and a desired point, 


* a [o| € '- 

Choose two positive definite matrices, K\ % K\ > 0, and define the “PD" algorithm 

r = *( 9 ) - K x \q4 ~ q\- 

In terms of the translated “error coordinate system" for P , 


the resulting closed loop system has the form 


-M'% - \r'(B + Kt ) 


= A[q,q\e. 


357 


Proposition 3 For all ~fo >0, 


is a Lyapunov function for Me closed loop system (6), 


0 

ItoAffa) 


Proof: It U clear that 0 is positive definite at the origin of the error system. Taking the time derivatives along the 

solutions of the closed loop system, (6), 


v * J« T [P>1 + A'P + P\e 

a "* T [S + 


Noting that e t = ftt it follows from Corollary 2 , that the second term is identically zero. 

□ 

There follows the desirable result that proportional and derivative linear state feedback stabilises a mechanical system, after 
the gravitational disturbance torques have been removed. 

Theorem 1 ( [0,10,11] ) The origin of the closed loop error coordinate system (6) is asymptotically stable . 

Proof: The existence of a Lyapunov Function, 0, assures stability. According to LaSalle’s invariance principle, the 

attracting set is the largest invariant set contained in {(ei,C|) € P : v s 0}, which, evidently, is the origin, since the vector 
field is oriented away from {«j = 0} everywhere else on that hyperplane. 

□ 

Notice that the proof of attractivity requires an appeal to LaSalle’s invariance principle in consequence of the fact that 0 
is not a strict Lyapunov function. In order to obtain the desired extension to tracking problems it is necessary to construct 
one. Unfortunately, the constructions devised to date require the artificial limitation to decoupled PD feedback. Namely, in the 
sequel, it will be assumed that the gain matrices of (6) are specified as 


K x £ u/’/; 


= 2 qul 


given two positive real numbers, w, f. 


2.4 A Strict Lyapunov Function for Nonlinear Mechanical Systems 

The following technical lemma will be of use in the main result, below. 

Lemma 4 For M(q) as in (l) and any positive scalars t a, 0,7 € R*, 


M=> ^ [ 01 lM(q) ] * " 


In particular , the matrix is positive definite when 


K a a 0 

0 7»(M) 


a 7 i/(Af) > 0 1 


Proof: 


a/ 01 


al 01 


it will suffice to show that 


01 7M{q) \ [01 iv(\f) 


= tf®/, 


This follows since all eigenvalues of K 8 / are eigenvalues of K, according to Lemma 12 in the apper.au. The particula 
conclusion obtains by taking the determinant of K. 


Proposition 5 For all p d € P and > 0, given any bounded set, 8 C P, containing p d there exists a scalar 7o > 0 such that 

u(e) £ -e*P(q)c = - e 1 [ ^ 1 e 

W 2 1,1 2 l u/f / 7 0 M{q) J 

is a strict Lyapunov Function for the dosed loop system, (6) on the domain 3, assuming the decoupled feedback gain matrices 
specified in (7). 
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Proof: 


Letting 


0 = »up||«ll. 

•cl 


find some 70 aatiafying 


I l i \ 

7#> ^{^WW + T 


w 


According to Lemma 4 and the inequality involving the first entry of the inferior set in (9), it follows that P is a positive 
definite matrix for all q € * 7 , hence v , is positive defnite at pi. 

Taking time derivatives along the solutions of iystem ( 6 ), we have 

v = \t J [PA + A t P + P]t f 


which may be expanded as 


T [ wAf 1 1 

W = " Wfe 7 „/ j« 

-wf ("To - 


+ 7o «I(}iVf - B)e,. 

The term in the last line vanishes according to Corollary 2 . Moreover, the block matrix in the first line is positive definite 
according to the inequality (9) and the result of Lemma 4 since 

[ w*Af“ l u/Af- 1 ] [ Afi 0 ] [ w 1 / ul 1 f AH 0 

[ 70 / J = [ 0 AT* J [ w/ 7oAf J l 0 AT* 

Finally, according to Lemma 1 , the term in the middle may be rewritten as 

wf((7o - = wfeJ((7o - 1)/ + ASf(Af > 0, 

where k = Af -I ci t and the result follows from the inequality involving the last entry of the set in (9). 

D 


3 Consequences for Tracking Unknown Reference Signals 

Now consider the decoupled “PD" compensated system forced by a continuously differentiable reference signal, ft (O’ 

r = k(q) - (**[«,(() - ?] - 2 u(q. (10) 

Assume that the reference trajectory is “unpredictable* — i.e. its first and second derivatives are unknown — but there is 
available an i priori bound on the maximum rate of change, 

M < flo- 

Notice that the forced closed loop system may be written in the same error coordinates as (5), above, 

«= ^|i,9]e + <*. (11) 

where d = [ J, is a ‘‘disturbance" input due to the unknown but non-zero reference derivative. 

Theorem 2 The closed loop ‘disturbed 9 error system (11) has bounded trajectories which asymptotically approach the set 

{« € P : ||e|| < ^2 aM)' + 1 /« i 


where 


Proof: 


K * w 1 

1 7 0 i/(Af)/w 


We have 


v = j* T (PA + A X P + P\e + e T Pd > 


5 + 


< [\\'MK) ~ + l/w’j . 

This is negative whenever e is outside the set indicated in the statement of the theorem. 
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□ 

Corollary 0 The asymptotic tracking bound may be made arbitrarily email by increasing the magnitudes of the feedback gasses 
in (10) 

Proof: For a sufficiently large value of ut it is possible to choose two real numbers *|,#C| € (0, 1) such that 

C = ; 7o = 

and the inequality (9) still holds. Using these definitions and the results of the theorem, the attracting region is bounded 
by the magnitude 

Note that 

u(K) * w + 7 o*'(A/)/w - yj{u ^Tfo^AQM 1 + 4 

= w + 4- 4, 

hence, 

MX) _ x v- *w{M) : Q 

^ ^(w — #C|i/(A/))* + 4 

and i/(/f) is bounded from below as w increases. Since may be made as small as desired without violating (9), the result 
follows. 

□ 


A The Stack Representation 

If A € R**"\ the “stack” representation of A 6 R”"* formed by stacking each column below the previous will be denoted A 9 

( 12 ). 

If B € R’ M , and A is as above then the kroneckcr product of A and B is 

<*\\B ... a\ m B 
anB ... a im B 




€ R"'** 1 *. 


a^B ... a nm B 

The kronecker product is not, in general, commutative. Note that while the transpose “distributes" over kroneckcr products, 
the stack operator, in general, does not. 

Lemma 7 If A 6 R n * m then there exists a nonsingular linear transformation of R nm , T t such that 

(a 1 )* = T A* 

Proof: For p = nm, let B = {b x , denote the canonical basis of R p — i.e., 6, is a column of p entries with a single 

’ entry, 1, in position i, and the other *» - 1 entries set equal to zero. The transpose operator is a reordering of the canonical 
basis elements, hence may be represented by the elementary matrix, 

T ~ jhj, 6n> 1 , 1 * •••» l t bj, h n +2t •••» ^Int l*3«» • 


□ 

For n = m, if we define P+ = / + T t P- = 1 - T then both operators are projections onto the set of “skew-symmetric* , 
“symmetric" operators of R", repsectively, since P* = P ± . Note that Ker P± = Im P*. 

The kronecker product does “distribute” over ordinary matrix multiplication in the appropriate fashion. 
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lemma * If Ae R ""•»,*€ R' K \Ce K— *,Z> 6 H»“' thrn 

• B)(C 9D)m (AC « BD). 

Lemma 9 ([13] ) //Be R""», 4 € R" K “, end C € R’"« Men 

|i4flC]' - (C T 0 

Noting that for any column, c 6 R , “ , t we have 

«* - [(«)*] * - «. 

there follows the corollary 
Corollary 10 If Be R""», c € R» then 

Be ■ Re* ■» (e T 0 /)B* 

, — (l® e | T )* " (* T ® T )* “ U ® * T ) (® T )* * 

Noting, moreover, that 

tr 

there followe the additional result 
Corollary 11 If A e R nnm ,B 6 R'"» then 

tr {dfl T } a (V) T B*. 

Proof: 

= (/*) ( B9I)A • 

= (it , ) T (fl T ®/)/* 

= (V) T /f. 


Lemma 12 For any square array , A 6 R"**, if f m i$ the identity on K m then the spectrum of (A ® I m ) is contained in the 
spectrum of A. 

Proof: Suppose A is an eigenvalue of ( A ® I m ). There must be some non-zero vector, x 6 R m *' in the kernel of 

A (I m ® /«) “ (A ® /) Since i = X 9 (= R"*"\ it follows that 

0 [A(/„ ® I m ) - (A ® I)\x 

= [AX - Xyt T j* 

= [X(A/.-yl T )]'. 

This implies that Im X r C Ker A f H — A % and since the former subspace has dimension at feast 1 (according to the 
assumption that X 0), the latter must as well. Thus, A is an eigenvalue of ^4. 
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B General Robot Arm Dynamics 

The rigid body model of robot arm dynamics may be most quickly derived by appeal to the lagrangian formulation of Newton's 
Equations. If a scalar function, termed a lagrangian, A * k — v, is defined as the difference between total kinetic energy, x, and 
total potential energy, v, in a system, then the equations of motion obtain from 

±D 4 \ -D'\ = r T , 

where r is a vector of external torques and forces [13,14]. 

First consider the kinetic energy contributed by a small volume of mass 6rm at position p in link £<,. 

s*i « 

where °pi = *p is the matrix representation of the position p in the base frame of reference, 0 /} is the matrix representation of 
the frame of reference of link Ci in the base frame, and { p is the matrix representation of the point in the link frame of reference, 
and, hence, 1 

Pi = A *P . 

since the position in the body is independent of the generalised coordinates. The total kinetic energy contributed by this link 
may now be written 

* = Icll^p]' F^pdm, 

■ ft, i tratt{Fi *p [A ‘pjVmi, 

= } tracc{Fi It,/? 1 ** [A]*} 

= 1 trace {FiTiF?}, 

(jince the frame matrix is constant over the integration), where is a symmetric matrix of dynamical parameters for the link. 
Explicitly, if the link has mass, m, center of gravity (in the local link coordinate system) ft, and inertia matrix, 7i, then 

■h o f Ti Mi ft 

' l m r Wx J ’ 

Passing to the ttatk rtpresentation (refer to Appendix A) 

2k, = trace {Fj\F?} 

= [(**)*]> 

-[(*®/) T *?] T Af 

= [F^Pif? 

= [{D'F?)q] r MD,F*)q 

= 7 T Mw, 

where we have implicitly defined 

M,( q ) & [D, F* ] T HD,# ; A = Pi ® /. 

It follows that the total kinetic energy of the entire chain is given as 

* = X7 T Af(7)7; M(q) = 

»* l 

The potential energy contributed by 6m, in C% is 

Svi = aj Ft ‘ pgSm , 

where g is the acceleration of gravity, hence the potential energy contributed by the entire link ia 

J 'pgdm, = zlF.pig, 

and v = £" =1 zjF.p-g. 1 

To proceed with the computation, note that D,X = D,k = q T M[q), hence, 

=q T M{q) +q T M{q). 

1 We will omit the prior superscript, 0, when it is clear the the coordinate system of reference is the base 
^Assume that zq ‘points up* in a direction opposing the gravitational field. 
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Moreover, 

d,k -U»D,[WUM 1 
- IWvlftM*. 

hence, if all termi from Lagrange’s equation involving the generalised velocity are collected, we may express them in the form 
« T B T , where 

B(«.4) T = Mto-5[4 T «W*'. 

Finally, by defining k(q) = [Z7 # v j T , Lagrange’s equation may be written in the form (1) 

M{q)q + *M)i + *(f) = r. 

M , called the "inertia* matrix, may be shown to be positive definite over the entire workspace as well as bounded from above since 
it contains only polynomials involving transcendental functions of q. B contains terms arising from "coriolis* and "centripetal" 
forces, hence is linear in q (these forces are quadratic in the generalised velocities), and bounded in q, since it involves only 
polynomials of transcendental functions in the generalised position* Finally, k arises from gravitational forces, is bounded, and 
may be observed to have much simpler structure (still polynomial in transcendental terms involving q) than the other expressions. 
An important study of the form of these terms was conducted by Bejcsy [15]. 
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1* Abstract 


,pipn wamnrr-U 


AMETEK/OREO inhoua 


development efforts leading toward a ^next-generation**, 
robotic manipulator arm and end-effector techno logy/ 
Manipulator arm development has been directed toward a 
mul tiple-degree-of-f reedom, flexible, tendon-driven con- 
cept which we refer to as a Tendon Arm Manipulator (TAM). 
End-effector development has been directed toward a 
three-fingered, dextrous, tendon-driven, anthropomorphic 
configuration which to as an Anthropomorphic 

Robotic Hand (ARH). Key\ technology issues are identified 
for both concepts. \ / 

\ ,s a y J*-*~*^*f 

2. Introduction 


£ O /V) 


/? rt 1 *2- 




The background, rationale, and requirements for a next-generation manipulator arm and 
end-effector are noted in order establish the foundational assumptions upon which the inhouse 
RfcD program is based. In order to relate the context for development, this background 
includes a brief synopsis of the projected telerobotics evolutionary path which AMETEK/OREO 
has advocated since its inception in 1977. 

Over the past five years, AMETEK/ORED has pursued concept development of a Tendon Arm 
Manipulator (TAM) through a low- level -of-ef fort inhouse RfcD program. This development has 
included three conceptual design configurations and two limited engineering development 
models. Results of this program to date are summarized. The latest TAM design configuration 
is illustrated and discussed, including performance design goals. Technical issues and 
enabling technology development are noted. 


The original RiD for the TAM included some preliminary work on a dextrous three-fingered 
end-effector concept. About two years ago, in response to a planned NASA program, this work 
was formulated into a concept design for an Anthropomorphic Robotic Hand (ARH) ✓ The concept 
was further refined and some preliminary design performed in response to the proposed DARPA 
Advanced Robotic Manipulator program. The baseline ARH concept design is illustrated and 
described. Technology issues and key enabling technology development are summarized. 


3. Background 

The first robotic manipulator arm and end-effector was adapted to a subsea remotely 
operated vehicle (ROV) in 1961. Over the succeeding 25 years, increasingly capable 
manipulators have been designed and applied to subsea R0V*s; in general, control of these 
manipulators has been limited to master-slave teleoperation, but has included bilateral force 
feedback on the more sophisticated systems. AMETEK has been involved in this applications 
arena for many years. 

In 1979, AMETEK/ORED initiated an inhouse study program to forecast next-generation 
manipulator technology. 

Technology advancement of articulated (reval ute-coordinate) manipulator arms appeared to 
be well covered, but we identified operations in unstructured subsea environments, e.g., 
around wellheads, where articulated arms were severely constrained in accessibility. In 
these cases, what was needed was a flexible "snake- 1 i ke" multiple deg cee-of -f reedom (DOF) 
configuration to work through and around a maze of obstructions. This need was not being 
addressed, and thus became a goal for further inhouse work. For end-effectors, other than 
specialized, task-specific end-effectors and tools, there appeared to be a driving need for a 




general, dextrous end-effector with kinesthetic and haptic capabilities approaching that of 
the human operator. We elected to parallel research on dextrous end-effectors along with our 
manipulator arm research. Progress to date on both the manipulator arm and end-effector is 
summarized in Section 4. 

In order to establish a context for this program, it is useful to briefly note the 
differences in orientation and approach between robotics technology development directed 
toward teleoperation and that aimed for autonomous applications. The issues, particularly 
with respect to control, are significantly different. The most notable difference is in the 
nature of the pacing robotics technologies: for autonomous operation, higher levels of 
control 11) are the pacing item, and mechanical systems with dextrous capabilities cannot be 
fully utilized as yet; under teleoperation, the operator provides higher level control, so 
h i gh 1 y-capab 1 e mechanical systems can more readily be utilized. Hence the motivation to 
prioritize such advanced mechanical systems development is greater for teleoperation. 

AM ETEK/OR ED advocates a view of the evolution of robotics technologies from 
te l eopera t ion toward f u 1 1 y-au t onomous systems, through progressive implementation of 
supervised autonomous modes of operation, as sensing, control, and computational technologies 
mature. An informative technical paper on this subject was written by J. Vertut, Manager of 
the Advanced Teleoperation Program in French Advanced Robotics and Automation project (2). 
Our views were expressed by AMETEK/ORED General Manager Jack Stone in his article in ROV 
Magazine 13). A more exhaustive treatment, including specific examples for space 
telerobotics, was provided recently by NASA/Montemer 1 o 14). 

4. Concept Development: TAM and ARH 

AMETEK/ORED has separated the inhouse IRiD program into two related concept development 
initiatives, the Tendon Arm Manipulator (TAM) and the Anthropomorphic Robotic Hand (ARH). 
The TAM concept is discussed first, followed by a discussion of the ARH concept. 

Tendon Arm Manipulator (TAM): 

Inspiration for the TAM concept originated with Tensor Arm Manipulator Design (Figure 
la) by the Scripps Institution of Oceanography (5). We also examined with interest the Spine 
manipulator arm (Figure lb) developed by Spine Robotics (61. 




Figure 1, Flexible Manipulator Arm Configurations 

Both of these configurations utilize a number of jointed discs, the planes of which can be 
rotated in two dimensions with Lospert to one mother. Each disc is driven by four tendons, 
two for each degree of freedom. Thus the arm has a maximum of 2(n-l) OOF, where n represents 
the number of discs (the first disc is fixed to the base or world frame, while the last disc 
serves as the base plate for the wrist). The number of independent OOF can be reduced, as 
desired, by establishing an angular relationship between disc rotations, e.g., the two 
sections of the Spine arm form only circular arcs of varying radius, so that the arm has only 
four independent DOF. 

The original TAM design con f igur a t i on (Figure 2 ) resembled the Scripps design because of 
its adaptability to multiple DOF and more arbitrary shapes. Four joint configurations were 
considered, varying the relative placement and connection of the joint with respect to the 
discs. A simple engineering model was built in order to duplicate some of the results of the 
Scripps work. As a follow-on, a larger engineering model wis constructed, specifically to 
emperically examine lojding of tendons and joints and instabilities. 
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Figure 2, AMETBK/ORED TAM, Original Design Concept 

Two major deficiencies of the basic Scripps configuration were confirmed: (1) a buckling 
instability (also noted by Scripps) between discs; and (2) high torsional loading of the 
joints under certain loading conditions* 

The current TAM design, illustrated in Figure 3, draws on the latest advances in 
"Serpentine Arm" technology, summarized in a recent Intelligent Task Automation report (7J, 
and addresses and corrects the deficiencies exhibited by the TAM engineering models. 



Figure 3, AMETEK/OKEU Tendon Arm Man.pulator (TAM) 

Baseline Concept Uesign 

In order to eliminate the buckling instability, sheathed cables are used for the 
tendons, each sheath terminating at the di3c preceding that being displaced by the tendon; 
this makes each displacement deter mi nate and precludes the buckling exhibited by *:he previous 
TAM models. To reduce the high stresses generated by torsion, the joints were reconfigured 
to form large-diameter doub 1 e-g i mba 1 1 ed rings; this not only increases the effective radius 
for reacting torsional moments but also provides a convenient center-arm space for touting of 
actuation cables. 

Performance goal 3 for the baseline TAM design include the following: 

• Length: 36" f rein shoulder to wrist base plates. 

• Weight: 20 lbs incl structure and tendons. 

% Payload: 50 lbs exci wrist and end-effector. 

• Speed: 180 degrees/sec, l/4-load (all joints). 

% Accuracy: 0.050" or better. 

% Operational Envelope: appro x i ma te l y hemispherical. 
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The current TAM baseline design, with each joint limited to 30 degrees angular 
deflection (as our research has indicated is a practical deflection upper Limit for a tendon- 
driven configuration), requires nine segments to achieve a hemiapher ica 1 operational envelope 
-- actually somewhat more than hemispherical as shown in Pigure 3, closely corresponding to 
an optimal operational envelope for an articulated manipulator arm. 

Obviously, given the current state of the art, control of such an arm, with up to 18 OOP 
for the baseline design, is a major issue. We have generated control scenarios, however, to 
account for this limitation: 

Por telooperat ion , each joint can be servo-controlled with a scaled replica master, with 
which the operator Shapes" the spacia 1 ly-cor respondent TAM. If, after positioning the arm 
at a work site the operator subsequently displaces the master arm such that the TAM contacts 
some obstruction, the bilateral control system compliantly reshapes the TAM around the 
obstruction and simultaneously conforms the master to the new shape. This represents a 
simple extrapolation of current technology. 

Por autonomous operation , the TAM can be limited in independent OOP by controlling groups of 
discs in a relational manner, as with the Spine arm. Such groufTng may be accomplished 
mechanically or electronically. Initially, as few as four independent OOP may be used 
(determinate), with increasing DOP and shape capabilities implemented as sensing, control, 
and computational technology advances. Ultimately, with the control loop closed around the 
end-point through sophisticated sensing and control, and with control strategies for 
i ndetermi nate arm configurations (e.g., world modeling with sensing updates and spacial 
distribution of allowable arm shapes and trajectories within the world model), the TAM should 
be able to achieve accuracies and capabilities rivaling articulated arms. 

Anthropomorphic Robotic Hand (ARH ) ; 


Much relevant work has been done over the past thirty years on dextrous end-ef rectors. 
Por the first twenty years, this work was almost exclusively in the aiea of prosthetic 
devices. An interesting example is the Belgrade hand (8). Over the past ten years or so, 
there has been considerable in^rest and effort directed toward dextrous end -e f f ecto r s 
suitable for robotic (autonomous) or mixed-mode (teleoperation/autonomous) applications. 
**Te 1 eoper a t ion", in this case, includes close-coupled prosthetic app 1 ica t i ons. The 
previously-noted report for the Intelligent Task Automation program [7| Includes a 
comprehensive summary of dextrous end-effectors. 

AMETEK/OR ED's initial work on a dextrous end-effector concept for the TAM focussed on 
the Multiple Prehension Manipulator System (MPMS) |9J design circa 1974. Thia hand, 
illustrated in Figure 4a, has three fingers, each with base rotation and link curl (total of 
six independent DOF). It is able to simulate all six prehensile modes of the human (as 
defined in the referenced article), but is not anthropomorphic. 



c> Salisbury End-EMactor 


a) Caporaii/Shartinpoor End-EMactor 


Figure 4, Dextrous End-Effector Designs 







Using the prehensile modes analysis, along with an analysis of the configurations of 
existing dextrous end-effector designs, notably the Jacobsen [10], Salisbury [11], and 
Capora 1 i/Shahinpoor [12] hands (illustrated in Figure 4), we derived a unique design, 
specifically directed toward anthropomorphicl ty and simplicity. Because of our orientation 
toward teleoperation, we gave anthropomorphicl ty a high priority. AMETEK/ORED designated 
this design concept the Anthropomorphic Robotic Hand (ARH). 

The baseline ARH design concept, as illustrated in Figure 5, utilises three fingers 
(configured as a thumb and two fingers) and a fixed palm. In order to directly mimic the 
grasping modes of the human hand, of particlar advantage for teleoperation, the thumb has the 
capability to rotate from opposition with the fingers to planarity with the palm. In 
addition, the digit joints of the thumb independently rotate curl the thumb as does a 
human thumb. Each of the two fingers has three joints; the knuckle and middle joints have 
independent rotation, while the end joint rotation is ratioed to the rotation of the middle 
joint (approximately 2:1). No lateral rotation is provided for the knuckles of the two 
fingers, but the base rotational axes are oriented with such that the tips of the fingers 
converge during curl to meet at contact with the palm. 



Figure 5, AiiLTEK/OREU Anthropomorphic Robotic Hand (ARH) 

Base'ine Concept Design 

Thus, the ARH baseline configuration has a minimum number ot independent DOF (seven, as 
compared to nine for the Salisbury hand and sixteen for the Jacobsen hand), and is able to 
achieve all the prehensile modes of the human in a direct anthropomorphic manner. 

The ability of the thumb to rotate to the plane of the palm uniquely provides a hook 
grasping mode in an anthropomorphic manner. Spherical and cylindrical grasp and closure are 
provided with thumb opposition and coordinated curl of thumb and fingers. Direct pinch with 
both or either finger tip(s) is enabLed by proper rotation and curl of the thumb with respect 
to the curl of each or both of the fingers, and coordination of these movements will al low 
pinch transfer. Finally, lateral pinch is enabled by rotating the thumb midway and closing 
onto the finger. 

Key technology issues in the areas of actuation, sensing, and control are, in general, 
being addressed through ongoing research throughout the community. Most of the critical 
elements currently exist commercially or are near transition from the laboratory. A complete 
review of the ARM baseline design is beyond the scope of this paper, but some of the key 
technology issues for both the TAM and ARH are noted in Section 5. 


5. Technology Development 

Preliminary design and development of the TAM and ARH have included research on pacing 
technologies, actuation, sensing, and control. Key issues, results, and recommendations 
f ol low: 

Actuation for both the TAM and ARM is provided from actuator mechanisms located in the 
base through sheathed cable tendons. The actuators could be electrical motors, shape memory 
alloy (SMA), hydraulic or pneumatic mechanisms. A particularly interesting actuation 
teehnnology is that referred to as "mechanical muscle M technology. SMA appears relatively 
less attractive because of the adverse relationship of force vs response, and high 
hysteres i s . 

In general, although "cleaner", pneumatic3 seem less suitable than hydraulics for 
actuators because of working fluid compressibility, resulting in compliance ("sponginess") 
and deflection rate ("stiction") chat jctec istics which are difficult to control. New high- 
torque rare-earth DC motors offer a very competitive alternative for actuation. 
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Suitable sensing receptors for force/torque are generally available, as well as position 
sensors, although current technology advances promise significant improvements. Tactile 
sensing elements for proximity (stretching the definition of "tactile”), contact, force, 
imaging, surface and material characteristics, and slip are the subjects of much current 
research. 

Breakthroughs are in order to be really applicable and useful for the ARH, but very 
promising devices are on the horison, e.g., thin micr omachined silicon arrays with both 
normal and shear measurement at each array site on 1mm x 1mm spacing. By comparison, 
currently available commercial tactile sensors have only normal force resolution capability 
at each site, and are approximately 1/2" thick (Lord tactile sensors). 

For teleoperation, sensing must include force/torque and tactile feedback stimulation 
for the operator. Force/torque feedback is state of the art for bilateral control systems, 
but tactile feedback is another matter. By comparison with tactile sensing receptors, 
relatively little work is being done in this area. An example of what might be done is to 
adapt the so 1 en 1 od-actua ted pin-matrix technology used for Braille readers to a hand 
controller for the operator. Such a device, perhaps fitted Into a "glove" controller, could 
potentially provide the operator with simulated contact, imaging, and force tactile feedback. 
Another technique has been suggested by AMETEK/OREDt thermal simulation of contact, imaging, 
and possibly force tactile feedback using a Peletier junction array. 

Control is g very complex issue, being addressed through many research projects in the 
community. We are generally only tracking technology developments in these areas for 
relevancy to the TAM and ARH. We have, however, developed short- and long-term strategies 
for control, focussing Initially on teleoperation for the short-term, and looking ahead for 
compatibility with likely future technological approaches for telerobotics and full 
automation for the long-term. This has been noted In preceding discussion. 


6, Conclusion 

This paper has presented an overview of AM ET EK/O f f sho r e Research and Engineering 
Division inhouse technology development efforts on an advanced manipulator arm (Tendon Arm 
Manipulator) and a dextrous end-effector concept (Anthropomorphic Robotic Hand). The current 
baseline design concepts for the TAM and ARH were presented and discussed, and key enabling 
technological issues were summarized. 
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Future Research Directions 

Moderator: G.A. Bekey 
University of Southern California 

Panel Members: R. Bajcsy, University of Pennsylvania, 

J.Y.S. Luh, Clemson University, 

A. Sanderson, Camegie-Mellon University, 

G. Saridis, Rensselaer Polytechnic Institute, 

T.B. Sheridan, Massachusetts Institute of Technology, 

C. Weisbin, Oak Ridge National Laboratories 

The intent of this session was to provide views on the state of telerobotics research and 
to draw together a collection of suggested research opportunities to present to NASA. 
The panel members gave opening statements summarizing their views, the results of 
earlier sessions, and discussion periods. This was followed by a general discussion with 
the audience. The moderator concluded the session by stating a synthesized set of 
recommendations. 

Cl. A. BBKBY: Welcome to the closing session of this symposium. I suggest each of the 
panel members take about ten to fifteen minutes to tell you about the research 
directions that they see in their own field, based on their experience or on the topics 
that they heard at this symposium. Then I suggest they take a ten- year leap into the 
future to give you a more distant extrapolation about where these research directions 
might or might not lead. After everyone has spoken, we will open the floor for 
comments, questions, contradictions, arguments and additional extrapolations. I am 
sure that some of you would like us to extrapolate an additional twenty years forward 
beyond the initial ten years. 

C. WEISBIN: As most of you know, after being here for three days and hearing all of 
these talks, any attempt to summarize near- term and long- term research directions in 
fifteen minutes would be a rather formidable challenge, so I will respectfully defer and 
do something else. I propose to tell you about the research items that I think are 
interesting and not worked on very much, some things I have heard at this conference 
that I would express some cautions about, and some things that might surprise us. So, 
what I cannot do is to state all of the research that needs to be done in vision, 
multi- sensor integration, and so on. Many of these programs have been ongoing for 
some time. My comments will obviously reflect my very personal biases. In a meeting 
with parallel sessions such as this, one cannot be at every session, and therefore cannot 
always be fair to everything that was presented. So, three types of comments: 
high-priority, cautions, and surprises. 

High Priority Issues . The first one has to do with the Minnesota work which deals with 
unexpected events and self-understanding. Ws are working in this area also. I happen 
to think that this is very important and it is an area which has not received a lot of 
attention. Most of us in this room realize that systems are not necessarily going to 
work the way we expect them to every time. But if one counts the number of papers or 
articles that address the issues of coping with contingencies and understanding 
self-awareness, I think we would find they are very few. A second area has to do with 
three-dimensional world modeling and understanding the environment. We heard papers 
for example from Hans Moravac and others on characterizing the world in which the 
robot lives. There, I suspect that people may assume that it is an easy problem, 
whereas in reality it is quite difficult. A third kind of problem is the issue of 
man- machine cooperative problem solving which was alluded to in several talks. The 
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problem consists of having nun and machined) act as a composite system to try to 
accomplish a specific task while equitably sharing the workload. One of the central 
problems Is knowledge representation. When the man tells the machine: "you take over 
vision, I am going to assume the planning role,” that assumes that each of them can 
communicate effectively at the same level. I do not think that we can do that today 
very well. Those are three topics (or topical areas) within the scope of the A1 discipline 
that I regard as high priority Issues. I do not think these areas are receiving adequate 
attention, although I feel they are very important. 

Cautions . We heard a few papers which deal with the Issue of reasoning with 
uncertainty. There has been a lot of time and effort spent on how to propagate 
uncertainty. One caution there, at least' In my limited experience. Is due to the 
potential difficulty in obtaining the uncertainties in the first place. We are not just 
talking about statistical uncertainties In data measurements. There are also 
uncertainties in rules and in heuristics which must be assessed. We must also be able to 
differentiate uncertainties from mistakes. There is much basic data that needs to be 
obtained. The limited efforts that we have made in that direction have indicated 
clearly that the problem of getting and characterizing the basic uncertainties data Is 
just as Important as that of propagating it and using it in a decision-making 
environment. We also heard some statements such as "this is an NP complete problem 
and we cannot solve it" and questions concerning the lack of formal methodologies to 
deal with the combinatorial explosion. But what if we are looking for a solution, not 
the best solution, just a satisfying solution. The warnings about combinatorial 
explosion are still appropriate and need to be heeded. But, in many cases, these 
problems might be made easier If we do not absolutely Insist on getting the best 
solution and find an adequate solution instead. We also heard several papers which deal 
with qualitative physics. This has been very enigmatic to me. Sometimes when you try 
to form a hybrid of that type, (hard science ("physics"] tempered by heuristic or Integral 
approaches ("qualitative"]), you can escape with the worst of both worlds. Qualitative 
physics has a very noble goal, but it can also be fraught with difficulties. Then we 
heard some papers on learning by discovery or by analogy. There l worry a little bit 
about the applicability to real-time problems. I think that is really hard, and l am net 
really sure it is possible. 

Surprises . The first surprise is that, out of the whole conference, l saw one or at most 
two papers in the area of concurrent computation and parallel computing. That 
surprised me because this is an area that is absolutely fundamental to getting anything 
done. There may have been many more papers in sessions that l did not attend. At 
least in those sessions that I attended there was surprisingly Uttle discussion of 
implementation of parallel algorithms. In a similar vein, you might be aware that there 
is a renewed interest in neural networks. The UCLA paper is an example of that. But, 
given the amount of interest that there is currently in neural networks, the number of 
papers on that topic that were discussed up at this conference was unusually few. 

I have several remarks about the conference as a whole. First of all, I wanted to thank 
the organizers. I thought that it was a really interesting and well organized 
conference. You inevitably run into a difficulty when you do things too well. The large 
attendance forces you to go to parallel sessions. That is the normal problem. The 
accommodations, the people who were here, the subjects, the papers, were well worth 
coming. It was all very interesting as far as we were concerned. We would look 
forward to another meeting of this type. 

T. B. SHBRIDAN: Bach of us looks at the world with glasses colored in a different part 
of the spectrum. My part of the spectrum is supposed to be that of the man- machine 
interface, which I have been interested in for a long time. From that 
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point of view, let me categorize my comments Into several parts. One of the most 
human things Is that we use words. In certain emerging fields, words sometimes get a 
bit fuzzy and Imply things that we. if we are honest with ourselves, may not quite 
muster. But we persist in using those words anyway. In a conference like this, and 
really for a long time, I have been a bit concerned about working toward using the 
Bnglish language in such a way that we are not kidding ourselves and are meaning the 
same thing. A term like teleoperator, which has been around for a long time means to 
do something at a distance. It has come to mean doing something with more or less 
continuous human control at a distance, although I suppose you could have a computer 
teleoperating another computer. I think that term is more or less settled in terms of 
human control. 

The word telerobot means controlling from a distance something which at least has 
some autonomy. It means human control, in presumably supervisory fashion, of a robot 
which to some extent is autonomous. But, it seems to me here NASA or somebody 
might define what the term telerobot means. Let us not use it to mean all of 
teleoperatlon. 

We come to the word telepresence, and there is an open opportunity for chaos and 
confusion. Telepresence can mean two things. It can mean feeling like you are present 
somewhere else. You could also use the word telepresence to mean you are operating 
at a distance just as though, and just as well, as if you were there. But, you can be 
telepresent and still be doing supervisory telerobotlc control. Or, you can be 
telepresent controlling and not feeling the present. What I am saying is that there are 
various Interpretations of that term needing better definition. One can begin to name 
other terms. It may be time to draft a semi-official glossary defining such words. 1 
think that is important, because people confuse each other with words. 

Some things I would like to applaud. We have talked about end- effectors for a long 
time, but we are beginning to see a real experimental science emerge on the use of 
more complex end- effectors. We are beginning to understand what it means to have 
multiple degrees of freedom in an end- effector. The work of Ken Salisbury, Larry 
Leifer, and others I could name is beginning to give us some nice science. Some of 
David Aiken's work on the combined use of the body and the hands is providing data. 
We need data. We have not had much data in this area. We are beginning to get some 
microdata. Blake Hannaford, Larry Stark and others reported on microforce data. That 
is, within a task, there are different forces that do different things at different times. 
I really applaud getting instruments right into the active manipulation site and getting 
some good hard data. Also we are beginning to understand this whole problem of 
impedance control. Not just force control. Not just position control. It may be some 
kind of a hybrid that generalizes the whole story. These things all go together. 

As a next category, let us look at vision. Some very exciting things are coming along 
with head- mounted virtual displays. These help on the sensory side to achieve some 
telepresence. They also permit us to do other things that we have not been able to do 
before: wider fields of view, certain kinds of simulation experiences, me of computer 
graphics to help an operator see things that he or she normally could not see, 
superimposing video graphics on TV, rendering multiple views coming in part from a 
model, etc. These are all things that are relatively close in. 

We jump a little bit farther and get the computer operating at a more profound level. 
We are beginning to see some nice work done in areas of linguistic interfaces with 
semi-automatic telerobotlc systems. Here, we should not lose sight that we need both: 
what I would call symbolic language, which I am using now as I speak; and analogic 
language, which we me with steering wheels, joysticks, and our hands when we point, 
pull, push, etc. In our everyday life we me both of these languages when we 
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communicate with each other and with the physical world, I am sure we need to 
understand how to combine the use of both of those languages. 

Computers for a number of years now have helped us resolve coordinate systems. That 
kind of thing is very well in hand. They are going to begin helping coordinate two arms, 
helping us coordinate arms plus vehicles that have to be controlled at the same time, 
and indeed helping us coordinate any linkage or set of rigid interconnected bodies with 
more than six degrees of freedom. There are some tough problems still to be resolved 
there. 

Looking into the future a little bit more, we see computer aids for sensing and 
planning. Now that is a bigger package. But, l think that as long as we consider that 
these are really, in large measure, aids to help the human sense and plan, we come to 
realize from looking at human anatomy that there are many more nerve cells for 
sens'^g and planning than there are for motor control. Indeed, in telerobotic systems, 
that is going to be the same. We are going to have much more cost, expense and 
complexity in sensing, planning and decision than we are going to have in purely motor 
control 

Another topic which I believe has been neglected is finding good performance measures 
for teieoperators and telerobots. It is still very difficult to determine that any 3iven 
telerobot is better than another. Of course, you have to be able to say that it is better 
at what. Now, you could say that there is no way of answering that question unless you 
are talking about a very specific task. That may be true in an ultimate sense. I would 
assert that there is, in-between, a possibility involving a battery of tests, intelligence 
tests or motor- skill performance tests. These tests would consider a whole range of 
different kinds of tasks that you can do with hands, eyes and sensors, automatic control 
system, or planning, etc. These might be a better foundation for comparing the 
performance of different telerobotic systems. We do not have that yet. In fact, if you 
look at the literature, very little research has been done on it. Everybody has a 
different task board, in effect. 

Finally, I will touch on one of the problems that has been a concern to a lot of people. 
The way that AI researchers, and other researchers that work on heuristic programming 
and develop LISP programs, talk about models and control is somehow different than 
the way in which control engineers talk about modeling and control (which usually is in 
terms of differential equations and things like that). I think that it is time that we 
interconnected these two ways of thinking: the analog and the digital. There is a lot of 
hard work to be done there. 1 see that as a very long range problem. 

G. SARIDIS: I will be trying to express my feelings, feelings is the right word, about 
the area that I am interested and concerned with. This is the theory of inteilegent 
machines. In this meeting, as in several previous meetings that I have attended, the 
underlying idea is that of associating some kind of intelligence with the robots or 
machines. I think that there are three basic disciplines that contribute toward a theory 
of intelligent machines: artificial intelligence, operations research, and system theory. 
All of us probably agree that this is where we should be Looking into, to develop such a 
theory. I was asked to express the present situation in my area and to discuss where we 
stand regarding intelligent machines. Let me be a little more specific. My interests 
axe autonomous systems, in contrast to man- machine interactive systems. Autonomous 
systems, typically called robots, are systems that can perform autonomously without 
interaction with the human operator. 

In terms of actual applications, I can say only one thing: there are very few. There are 
very few places where one can find a machine that has some kind of intelligence. There 
is a lot of research going on. For instance, I just came back from a visit to JPL, and I 
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was really Impressed. I saw a “smart" hand, and I saw some other components of 
intelligent machines. For quite a while, JPL has been a pioneer in this area. In general, 
NASA has been interested because of the unique work in space that it is doing. This 
includes both manned and unmanned space exploration. I think we have been 
de- emphasizing unmanned space exploration. The funding seems to have decreased, 
although great work has been done. Instead it should have been increased. Industry is 
another area where these machines are applicable. We are talking about the factory of 
the future. It is going to be modularized. There will be no workers there doing things 
that the machine can do better. The human operators will be in a role involving 
primarily monitoring and supervision. There might be some maintenance crews. The 
humans will be doing the more intellectual work of designing and building those 
machines. Underwater and underground exploration are other very important 
applications. Automation of mining operations is another application area in which 
there is much Interest. Nuclear handling is another area. This has been explored for 
quite a while. There are, for example, Interesting applications for smart robots in 
safeguarding nuclear plants. The problem of safety could be improved. There is also 
some role for automation and robotics in medicine. Other organizations (e.g. 
government agencies) have also established efforts in automation. Substantial research 
is also being conducted in universities. Most of the major universities have a strong 
program in automation. The automotive industry has also been pioneering in research 
to create an automated environment in the factory of the future. I would like to make 
a parenthetical remark. An intelligent machine can be any machine that can perform 
tasks (including anthropomorphic tasks) without interaction with a human operator. 
Some of the machines used in the automotive industry are examples of 
non- anthropomorphic machines that can perform anthropomorphic tasks. Work in Japan 
has shown the first level of intelligent or semi- intelligent (i.e., smart) machines that 
will remove the worker from the factory floor. Now they are working on the second 
level which will demonstrate more intelligence. They have different stations and robots 
that can move from one station to another and be reprogrammed. 

I would like to now turn to the issue of how to build those machines. There exist, of 
course, various components. There is the motion hardware. There is sensing - different 
types of sensing such as vision, tactile and proximity sensing, etc. One issue is how to 
integrate them. How do you get the people involved in these areas to communicate 
with each other? How do you get what I call equalization of communications? How do 
you establish a common language so that there can be meaningful exchange among 
the various research groups? I am a strong believer in a mathematically structured 
intelligent machine. How is that going to be materialized? I proposed a probabilistic 
model. Other people have proposed fuzzy or possibilistic models. AI researchers want 
to have a purely heuristic model based on AI principles. Are we going to have a 
hierarchical structure or a distributed structure? What are we going to emphasize? 
Kinematics? Dynamics? Both? Is control going to be adaptive? Is trajectory planning 
going to be done using simple kinematics? How are the actuators going to be 
incorporated? How is the hardware going to be improved? If there is some kind of 
heirarchy, Jiow are the higher levels going to be constructed? Is there some additional 
intelligence at the higher levels? How are tasks going to be executed autonomously? Is 
the task to be decomposed or synthesized for comparison with the command? 

Finally, I will make some brief comments about the future. What is the final goal of an 
intelligent machine? How can we tell that the machines that we are building are really 
intelligent? How can we measure that? In response to this question, I will offer the 
following remark. We will have succeeded in building machines that are really 
intelligent when we can build robots that can by themselves build more robots. 

A. SANDBRSON: I will comment on the topics in artificial intelligence that have been 
discussed at the meeting. Those topics have focused on issues of task planning and 
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trajectory planning! as well as navigation of mobile robots. This is a subset of the 
broader spectrum of Issues In artificial Intelligence, but It seems to be a subset that Is 
particularly relevant to the kind of tasks that NASA hat defined. In this area, there is 
no shortage of buzzwords and Jargon. For example, the discussions reflected the Issues 
of: what la scheduling vs real-time planning vs what is off-line planning; what Is 
reactive planning. There It a whole set of terminology which reflects, In a sense, the 
rather nebulous view not only of the solutions but of the problems themselves. I think It 
also reflects the clear view that these kinds of techniques are going to be the keys to 
the evolution of the technology, but In fact there are basic concepts and basic Issues 
that need to be worked out. It Is not a case of taking existing tools and applying them 
appropriately. It Is the case of working out some quite basic principles and 
understanding how in the case of particular domains and applications they can be made 
useful. 

This background slide suggests a task domain where Issues of task and trajectory 
planning are relevant to space applications. 

BACKGROUND 

Space-based Diagnosis, Repair, and Assembly Tasks: 

• Materials Handling 

• Fault Diagnosis 

• Reasoning about the Origin of Paults 

• Hypothesis Formation and Testing 

• Planning and Execution of Repairs 

• Disassembly and Assembly 

• Replacement of Parts 

This slide suggests seme topics or tasks in diagnosis, repair and assembly of systems. 
The kind of issue which arises here is the mixture of complex tasks that need to be 
accomplished in a relatively isolated environment. There are problems of materials 
handling, reasoning about faults and diagnosis of faults, forming hypotheses and 
developing strategies to test systems, planning and execution of repairs of those 
systems (which in Itself involves assembly and disassembly of parts and replacement of 
those parts). These are all subtasks which go along with the Idea of having systems in 
isolated environments, and other systems with sufficient capability to keep those 
systems repaired, maintained and serviced. 

You can look at this list and ask what is different about doing this in a space 
environment, as opposed to the problem of repairing your computer when it breaks 
down. I think there are several dimensions to that. One is that the physical 
environment in which you need to carry out these operations is quite different. The 
vacuum, zero-gravity, lighting conditions, the physics of sensing and manipulation are 
quite different. So you need to be able to do the manipulation and do the tasks in a 
different environment than might be done in other situations. Secondly, the situation 
itself tends to be more isolated and less interactive than in other cases. It really forces 
you into the issue of asking what it takes to do things in an autonomous or 
semi- autonomous way, where you do not have the fall- back position of someone walking 
in and Interacting with the system. So it really forces the issue of how to accomplish 
useful work and tasks in an autonomous way. The third aspect is the nature of the 
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syster.4 themselves. That Is, you ere not typically dealing with a line of computers or 
television sets that you see the same things with the same faults every day. There are 
many unique systems which have to be addressed, unique designs, unique requirements! 
and you have to be prepared to cope with those unique requirements. So that set of 
Issues really defines these tasks as quite different from any other related tasks. The 
collection of those poses a considerable burden on how you make systems, and 
autonomous systems, to be able to cope with these issues. I could make a similar list 
for another area such as navigation, such as exploration, and with similar kinds of 
constraints that are Imposed that make the task in fact quite difficult and challenging. 
The technical challenge is, I think, apparent. It is a combination of reasoning and 
manipulation, and how to combine those together. 

I will use one other slide which summarises some of the research issues which are posed 
by this kind of task. 

RBSBASCH ISSUBS 

REPRESENTATION 

• Geometrical, Physical, Functional 

• Uncertainty 

• Dynamics 
PLANNING 

• Diagnosis and Repair 

• Sequence Level 

• Discrete Task Level 

• Motion Level 

CONTROL: MANIPULATION AND SENSING 

• Adaptive Sensor- based Systems 

• Learning Systems 

• Multisensors 

This summarizes many of the issues which came up during technical sessions. I will not 
try to summarize all those in detail. Let me just make a few points. I think that there 
is a clear relationship between representation and planning and control. As someone 
suggested, planning and control are not as different as many people sometimes pretend. 
If you think about what even classical control attempts to do, it really largely has to do 
with resolving uncertainties. It has to do with being able to predict what those 
uncertainties are going to be, in a way that you can take timely actions to correct 
them. Th.\t has to do with the fact that, even in relatively simple control systems, you 
do not have a perfect model of the world, and you cannot devise your actions totally in 
advance in order to accomplish a task. So, you have to work with uncertainties, and you 
have to be able to develop predictions of the world in order to accomplish the task. 
This has a lot to do with planning, and planning, as it is used typically in AI, is a kind of 
reasoning in a symbolic world. It is a representation of the world which tends to be 
symbolic and relational. The kind of issues of representation includes geometry, physics, 
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functions of systems, temporal aspects, etc. Representation of uncertainty la 
fundamental to that, In the same way that It Is In control. It Is uncertainty, again, not 
In the sense that there are things that you simply do not know. But, the more that you 
can explicitly describe the nature of that uncertainty, the more knowledge that you 
have to Incorporate Into the system. The dynamics are Increasingly Important as you 
look at the kinds of systems which are being discussed here. So, as you look at the 
strategies toward planning, you typically see a hierarchical approach. You typically see 
various levels of abstraction defined. In the case of manipulation, you can define 
requences of tasks. You can define sets of discrete operations. You can define explicit 
motions. In the sense of trajectory planning. It is easy In a sense to link those together 
In a block diagram. It is much more difficult to be able to make systems that work and 
accomplish those tasks. One of the Issues, if you look at the work on planning, is that 
you do not see at this point complete working systems. You see pieces of systems and 
concepts toward working systems. At the lower levels of control, manipulation and 
sensing, you see a trend toward more adaptive and learning systems with the ability to 
Integrate information and resolve uncertainties among multiple sensors. 

I have a few notes on specific topics, which I think are worth mentioning. One is the 
problem of validation and evaluation of systems. How do you know when a plan works? 
When it is working, how do you know when to go back and replan? Again, in a dynamic 
and changing environment, that is important. The question of unexpected events and 
error recovery is implicit in this. Someone pointed out that, in many robot programs, 
90% of the program code deals with the unexpected events. The question of why is 
planning difficult, again, we could go on and on about. One of the basic issues is the 
computational and combinatorial barriers that you reach, given the complexities of the 
task. Trying to evaluate all the possible alternatives simply yields a combinatorial 
explosion, tn practice, the solution to that is to look for efficient representations, and 
often domain- specific representation* and control mechanisms. 

I will finish with a couple of remarks. One is to reiterate that the payoff I see from 
this technology is really in the evolution of systems. That is, it is important to have the 
core technology which becomes embedded in the telerobot systems and evolves into 
autonomous systems. The final payoff as we expect is perhaps with autonomous 
systems, but issues such as representation, such as system architecture, we cannot 
simply wait and implement in autonomous systems. I think the concepts have to be 
embedded earlier in the evolution of the systems. The final remark is a rather 
philosophical view of robotics. One can think of robotics as a collection of devices 
(robots, hands, sensors, etc.), or one can think of it as a science which deals with 
principles. In fact, it is a mixture of those things, tn many ways, I think of it as an 
experimental science, one in which you have to build systems and try to find out why 
they work and why they do not work in order to make progress. I think in this area in 
particular you have to build systems and use planners with real robots in order to make 
progress and come up with useful results. 

R. BAJCSY: I will concentrate on stating my views on future research directions in 
sensing and perception. Here is the world which we live in. Here are the sensors, i.e. , 
hands, ears, eyes, etc. Here is the representation. To set the stage, I assume that I am 
interested only in the world of physics and geometry. I assume, for the purpose of this 
discussion, that we have only contact and non- contact sensors. This is not a severe 
limitation, since in fact there are no other types of sensors. I would like to impress 
upon you that sensors sense only partially through a window of the world and only some 
aspect of the world. So one sensor can never sense the full world. So, having set the 
stage, I proceed to the mission impossible. The job is to recover the world from the 
partial measurements. Here are my questions for the next two or three years. Then, I 
have another set of questions for the following two or three years. Then, I have another 
slide for the next ten years. Question number one, which is dear to my heart, what is 
the information that you lose through these transducers? From which follows, models 
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of hardware and/or software. In my view, software Is e<[Utvalent to hardware, l.e., an 
edge detector could be Implemented either in hardware or in software. These models 
represent the processes that have to take place as you go from the world representation 
to the sensors and to the sensor data. So, one issue Is what is the Information that is 
lost, and hence the modeling question. The second question is what are the rules, or 
principles, of recovery. In the first question, things are being taken apart, whereas in 
the second, things are being put together. The third question, which is very clear in 
tactile sensing and information processing, is that of data acquisition. This is an 
essential part of the recognition, or manipulation, or whatever use is made of the 
information. This issue, in my view, is )ust as important in vision, although it may not 
be as readily apparent. Hence, the control Issues are a critical part of the requirements 
for machine sensing. We must get away from an approach to machine sensing in which 
there are no global goals that involve how the Information is to be used for the purpose 
of control (as an example). This is my view of the present state and of the Important 
immediate Issues. 

I would like to share with you what 1 think is a plan for the next ten years. I really view 
space exploration snd space robotics as representing a great opportunity as a research 
laboratory, or as a research environment, for discovering rules of evolution. Why7 Well, 
space represents different physics and physical laws. Different radiation. Our sensors 
have been developed or have evolved to be sensitive to a certain spectral range. In 
space, different sensors will be needed perhaps. But, of more interest to me, is the 
different representations that will evolve from the different sensors and physical laws. 
Different rules for update of this knowledge will also evolve, as the sensors take 
measurements and interact with the world. Hence, the rules of evolution. 

J. Y. S. LUH: Being the last person in the panel has advantages (I will not say 

disadvantages). All of the important and interesting issues are well taken care of. That 
means that I do not have to say anything. On the other hand, I have noticed a few 
minor things which have not been mentioned by the previous panelists. These are 
down-to-earth, for instance, kinematics, dynamics and control. These are basic issues. 
Without kinematics, dynamics and control, there would be no robots, either in space or 
in industry. Of course, there has been a lot of research on these topics for many years. 
Nonetheless, there are still open Issues that need to be studied and investigated in order 
to improve space operations. 

Por tnstance, one issue is that of robots with closed- kinematic- chains. These robots, it 
seems to me, would be very useful. This topic was studied many years ago in a very 
general sense. But, very few results were obtained that were specific to a robot and 
that were relevant to space applications. In order to investigate the closed- chain 
robot, extensive computations (for instance, to do inverse kinematics) are required 
quite often. To shorten the computations, parallel computation is one of the topics that 
needs to be looked into. Redundant robots is another topic which I think is important. I 
only judge from my experience. Everybody knows that the human body has redundant 
kinematic chains. Without the redundancy, you could not do a lot of things that you 
normally like to do, such as scratching your back when it itches. If we look at 
dynamics, we need to do a lot of work on analysis of flexible joints and flexible links. 
So far, the most complicated flexible robot that, to my knowledge, has been analyzed 
consists of two links. This is not really enough. There is a need to pursue that kind of 
research. 

If we look at control, we me a lot of words and names. For instance, one of the oldest 
control schemes in robotics is that of computed torque. That was done ten or fifteen 
years ago. Then, later we developed resolved rate, resolved acceleration, nonlinear 
transformations, adaptive, force control, impedance control, etc. But, there is a result 
in a paper presented here by K. Kreutz of JPL in which he found a lot of equivalence 
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unions the various concepts. Bp changing the names and equations, a lot of control 
schemes which seem to be different are in fact equivalent. 

But, whether the control schemes are equal or not, the most analytical results are very 
seldom applied or implemented in hardware. As in several of the earlier comments 
made by the panel members, robotics is an experimental science. It is nice to do 
analysis to guide your experiments. But the final )udgement really comes through 
experimentation. All the control schemes may look beautiful in analysis (and I expect 
to get a lot of comments from the audience on this), but we also need the 
implementation and experimentation to prove feasibility and performance based on our 
current hardware technology. 

In space applications, I listened to a paper presented on a virtual robot. I think this 
type of work is particularly relevant to space applications. Similar or equivalent work 
should be promoted. 

Now I move to teleoperation and telerobotics. I agree with comments made earlier 
about terminology. Telerobot implies remote control of a robot. Teleoperatlon can 
involve two robots or even multiple robots. There are a lot of papers on that topic. 
There are two items which are basic Issues and which have not been mentioned. It is 
my opinion that the main issue in the coordination of two or more robots is how do you 
divide the work. This is the task decomposition which is the main issue before we even 
start to do anything else. There is a Chinese story the: j pertinent here. There is a 
monk in the monastery. If he wants to drink water, he carries the water from the well 
to the monastery in two buckets of water, one bucket at each end of a beam, and the 
beam is balanced on top of his shoulder. If there are two monks in the monastery, then 
the same beam can be used. Bach of the monks supports one end of the beam. Now, 
however, there is only one bucket at the mid- point of the beam. If you have three 
monks living in the monastery, there will be no water. The point of the story is that in 
the monastery they must decompose the task to determine who Is doing what. 
Otherwise, the three monks (or robots) will Just sit there and argue who is doing what. 
A second topic which is also basic, in my view, and which has not been mentioned is 
that of limb coordination. We are talking now about two robots and about perhaps 
extending this to walking robots. There is a lot of Interest in walking robots, instead of 
wheeled robots. Multiple arm robots with dexterous end- effectors are also being 
discussed for space applications. I beiieve that the coordination of arms with the legs 
and end- effectors is also a very important issue. 

G. A. BBKBY: l would like to thank the panelists for their contributions. We have 
about 45 minutes now for comments, rebuttals, questions, suggestions, etc. If anyone 
would like to make a comment, l would like to suggest that he or she step to the 
microphone. 

A. J. BBJCZY, JPL: l would like to make a comment on intelligent machines and their 
relation to man. How do we communicate with intelligent machines? I assume that 
intelligence is also measured not only by the action that a machine can perform, but by 
the way that a machine can communicate with other machines, or in our case with an 
operator or with someone that is supposed to use it. In a telerobot sense, an intelligent 
machine is for a user. I would like to ask a sharp question on the issues of how to 
communicate with intelligent machines and how is Intelligence defined from that point 
of view. 

G. SARIDIS: This was a loaded question. First, let me start with the intelligent 
machine itself. I do not think that we have built an intelligent machine. Therefore, we 
have not explored all the capabilities of our machines regarding intelligence. But, 
suppose for the moment that we could do that. A good idea is to use the same approach 
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that we ere using in desling with expert systems. With expert systems, we have some 
intelligence stored in the computer. The operator can. by means of if-then statements, 
communicate with that particular computer. This assumes that the computer in which 
the expert system resides has what I would call static intelligence. An intelligent 
machine would have what I would call dynamic Intelligence. I think that it would be an 
improvement on an expert system to do that }ob. 

8. BAJCSY: Communication with an intelligent machine depends on the representation 
that the particular machine has. It was precisely this point that I was trying to promote 
in my presentation. The world is out there. We have these limited sensors which 
reduce the data and store it in memory in some organized fashion. This we call 
representation. There are also additional rules and laws that control both how to gather 
the data and how to store the data. In space, we have this wonderful opportunity 
because we do not have a preconceived a priori representation of that world. 
Therefore, we have an opportunity to gather this knowledge iteratively and to learn 
something about learning itself and also about evolution. 

A. LOKSHDf. JPL: 1 would like to share my views on the problem of robot error 
recovery after what are referred to typically as unexpected events. I believe that they 
are quite close to the views that the panel expressed, but l would like to introduce 
additional ideas. 1 believe that the only unexpected event that could be properly dealt 
with is an event that is expected well in advance. If you look for example at our 
society, people (such as fire- fighters, commanders, etc.) that are supposed to deal with 
real unexpected events, have very rigorous training to be able to foresee any kind of 
possibility in advance. If a person behaves properly in real unexpected events, he 
usually gets a medal. I do not think that the PUMA 560 deserves a medal every lime 
that it does a simple lask such as acquiring and grasping a tool. Therefore, a more 
precise term than unexpected events is that of abnormal events. If we use this term, 
then work that focuses on case studies in a particular task domain will be encouraged 
and more readily accepted. Such case studies that address the issue of dealing with 
abnormal events in particular task domains are needed in robotics. For example, a 
robot could try a thousand times to unscrew a bolt and then tell us all of the problems 
that it encountered in all of those attempts. This experimental approach would be very 
similar to the one used in the very early stages of the development of coding theory. 
There, a lot of very simple statistical experiments (such as counting the number of 
times a particular letter of the alphabet was used in average speech) were conducted. I 
think this type of work and presentations should be encouraged in robotics also. 

A. SANDB8SON: I agree. There are levels of abstraction in terms of thinking about 
what is expected and about what is normal or abnormal. The key question is what is the 
capacity of the system to interpret any kind of event and how does it decide what to do 
about them, (n some sen c what you are referring to with the PUMA is that, when you 
program it to read a monitor that is tripped, and you have a fixed subroutine that it 
goes into, you can argue that it is error recovery, but it is not intelligent error 
recovery. The more general case that you are referring to is where you have a deeper 
representation of the system and where, instead of having a kind of subroutine, you 
have some capacity to reason about the nature of the event and its origin and to think 
in a deeper sense about the appropriate reaction, given all the constraints, the stales of 
the system, and the goals. 

C. WEISBIN: I think that it would be rather bold to suggest that we could deal with 

unknown unknowns, which is what I think is at issue. When you try to put a framework 
in place to deal with classes of potential events, as opposed to specific recovery modes, 
you need a framework for broad on-line dynamic replanning, which is more than simple 
error recovery. 
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I fully Agree with your question. If I did not ever entlc Ipete something to happen, It 
would be unlikely that our robot would cope with it properly. On the other hand, I do 
believe that you can anticipate broad classes of events that you can handle on-line 
through a replanner, without having prescribed in detail the particular event and the 
particular reaction to the event. No nutter what we call that, whether it is error 
recovery, unanticipated events, robust planning, etc., the ability to perform this 
function is essential. 

G. A. BBKBY: Let me add another comment to that from my personal experience. 
Many years ago I worked on issues of trying to model human performance in particular 
kinds of systems. Tom Sheridan has done similar work. One time, we were Interested 
in modeling the way in which human pilots adapted to particular catastrophic failures in 
aircraft, under situations where recovery was possible. There are obviously times, such 
as the extreme case in which the vertical stabilizer falls off, when you cannot recover. 
There are situations where the stability augmentation system fails, and recovery is 
possible, but the control strategy that the pilot has to follow is different. The issue 
here is very similar to that raised by the question. Is this an unknown unknown, or is it 
an unknown for which previous learning took place and there are strategies of recovery 
stored somewhere in the system7 What we did was to try to model the recovery 
strategy, and it was obviously highly dependent on previous training. This raises very 
complex issues in AI and in the way in which knowledge is represented. It is a very 
interesting question with a lot of implications in many of the areas that we are dealing 
with. 

R. SCOTTI, ORI: I would like to raise the issue of what is the ultimate model; what 
sort of ultimate model are we thinking about, when we think about robotics and 
artificial intelligence. The question came up in one of the sessions, where we discussed 
the prospect that perhaps it is an anthropological model that we ar? talking about if we 
follow the idea that man is the model. It is an issue, what is the model. As in our 
experiences with science in the past, solving a problem really revolves about 
understanding and having a clear idea about what the problem is. History has shown us 
so many times that once we get to that stage, then the answer just comes out. The 
issue of what is the model is really important, and I would like to hear something on 
that. 

Aijgth-r issue that has come up is that in many other fields of human activity, 
specifically sports, there is an entirely different understanding about the relationship of 
thinking, acting and accomplishing. This is the concept of visualization. I have seen 
some really wonderful demonstrations of optical simulation which has been proposed as 
a method for doing things where you cannot sec. But, the whole issue keeps coming 
back to me of how we can investigate the relationship between what a person conceives 
and visualizes and what that person can actually accomplish. Or, turn it around in the 
other direction. I think that we have not yet begun to scratch the surface of this very 
interesting area. 

A third area, which may be a bit sensitive, but which l feel strong enough and bold 
enough to mention: what is our responsibility as sincere dedicated people to unfolding 
these concepts and putting them before humanity in our evolution? What is our 
responsibility to the morality and to the integrity of these things to which we are 
dedicating our lives' energy. Specifically, what is our responsibility to the future 
environment in which we have to work. It is definitely an economic, financial, 
environment. Space station for example is going to require funds and guidance coming 
down from the top. We are all of the age where we have seen these projects come off 
from a very sound position of integrity and morality. Then, for some other reasons, the 
scientific results are used to address other issues. 
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O. A. BBKBY: Let us respond to the three Issues one et e time. Does anyone went to 
comment on whet the right model (or robots in AI Is? be human the right model (or e 
robot? 

T. B. SHBBIDAN: We probably ell have something to say about that. Bach o( us cannot 
get out o( his own skin. So, to some extent, our model o( the world starts out being 
ourselves. There b no other way it can be, (or starters. The better we define and 
understand a problem, the less the thing we build to solve the problem looks like 
ourselves. This has been generally true In human technology as it has evolved over the 
centuries. It seems to me that we always start out anthropomotphically. But the 
better we understand it, the less It looks like a human because we refine the technology 
to serve that particubr need. At some point, it may stop looking Uke a human. In so 
far as it b so robust that it handles things that we cannot anticipate as human beings, 
we get back to our unexpected event, and there b no solution. My feeling about 
unexpected events, to come back to that for a moment, is that there is an expectation 
of aero on the density function. It involves infinite information, and there Is zero 
apprehension on the part of the human. We are always in this sort of scale, starting 
from ourselves and what we totally expect as humans, and sliding toward specific 
technology for specific tasks as we understand them. 

C. WBISBDf: I would take the position, as an engineer, that we are trying to get a Job 
done. Alternatively, one could take the classical AI approach that aims to study how 
humans reason and think. I do not think that there is a right or a wrong answer. 
However, l would clearly come down on the side of getting the Job done. I am not sure 
if you were asking whether the physical object that gets the Job done should have 
anthropomorphic characteristics, or that the cognitive aspects should have 
anthropomorphic characterbtlcs, or both. I think that we would be doing ourselves a 
disservice if we insisted that robots have only two arms, that they be able to lift only a 
certain amount weight, etc. - in order Just to follow the human model. One could argue 
that, since most of our current plants are built for humans, we ought to build robots 
that look like humans because they would fit into current plants. This might be 
short-sighted. 1 would therefore come out axainst the view of physical 
anthropomorphism and the view of exclusively confining ourselves to paradigms of 
human reasoning. Where you can solve differential equations and accurately get the 
right answer, this should be the approach. To take the view that we should reason with 
heuristics because people do, is not always the best approach, in terms of getting the 
Job done. If you are trying to study human beings, then the context is different and it 
would make sense. 

G. SABIDIS: I would Uke to bring up an example that I encountered when I was in West 
Germany last summer. There was a project about plugging some holes on the body of a 
Mercedes-Benz with rubber plugs. Humans used to do that, but it was to cumbersome 
and unhealthy. So, they decided to try to replace the human. They did a study to 
understand if a vision system could be used to center and align the arm to the hole and 
if an arm control system could then be used to push the arm into the hole. They were 
very unsuccessful. They replaced that with a magnetic scanner, and they were able to 
do the task with only one attempt. This proves the point that C. Weisbin made. 

R. BAJCSY: If I assume that in your question you are equating model with 

representation, then I wish to make a very strong statement. There is not one unique 
representation. There are multiple representations for anything that you wish to do. A 
second remark is that: I do not think that Homo sapiens should be so pretentious to 
think that they are always the best models for a given Job. I think models should be 
task- driven. 
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G. A. BEKBY: There t* a cartoon that many of you may have seen. It shows a human 
being whose function It was to push the emergency button in case of a problem, and he 
was replaced with an anthropomorphic looking robot with its finger ready to push the 
same button. This is clearly not the right kind of a model. 

The second question had to do with the relationship between thinking and 
accomplishment. Since human beings can improve their tennis by ptaying inner tennis in 
the sense of using visualization to Improve their performance), can robotics be 
improved by using a similar approach? 

C. WBISBDf: Speaking as a rather poor athlete, who is uniformly poor in a variety of 
sports, I believe that athletes improve performance through rote practice without 
necessarily visualizing each detailed step. When you throw a bowling ball 27 times, or 
2000 times, after a while it becomes a routine. If you are trying to get a kill shot in 
racket-ball, you do not necessarily do that very well through visualization, you do it 
pretty much by rote after significant amounts of practice. I make no scientific claim 
that this is true. It would be quite reasonable to take some intelligent machine for 
example, and have it practice in a variety of related configurations and try to improve 
skills through perturbation and credit assignment reward, rather than differential model 
building. That would get us into the area of learning. I think the athlete analogy is 
more one where humans improve pretty much through repetition. 

A. SANDBRSON: Let me propose one possible scenario. You could think of a 

simulation as a kind of visualization in which the robot does not actually execute the 
command physically but could execute parts of it in conjunction with simulating other 
parts. In fact, it might be able to practice and learn without actually doing the 
execution itself. Some of the psychology that goes with the visualization approach says 
that humans may be able to do that. 

G. A. BBKBY: Is any one willing to deal with the third and more difficult question, the 
one that deals with responsibility? 

T. B. SHBRIDAN: First, I wanted to commend you for asking such a question, because 
it is terribly important. It is the kind of question that is so embarrassing to talk about, 
that we do not talk about it at scientific meetings like this. I think however that it is 
ultimately very important in AI and robotics. One of the reasons that we behave 
ourselves in the world is that, when we start encroaching on the rights of other people, 
we put our own bodies, in effect (personal reputations, health, etc.), on the line. Once 
we are able, in a telerobotic sense, to guide or to initiate the actions of 
semi- autonomous devices that can go out and creep around the world, and make trouble 
in the backyards of others, it will be very easy to abandon responsibility and to claim 
that the driver is not us. In fact, responsibility vill be increasingly more difficult to 
trace back. We are getting into a territory here which is fraught with social problems. 
It is also my belief that SDI is the tip of that iceberg. I hope sincerely that NASA does 
not get in that business. 

G. A. BBKBY: l think it is also important to know, however, that much research in 
robotics, as G. Saridis pointed out earlier, is concerned precisely with removing humans 
from some of those jobs which are the most hazardous to them, whether it is in mines, 
or in the presence of hazardous chemicals, etc. I think that clearly is an issue in which 
robotics is contributing to human welfare. 

G. SARIDIS: I think there are two questions involved in that. One is our responsibility 
in creating machines that might be used by others in detriment to humanity. Second, 
suppose that we create intelligent machines, and these machines get out of control. 
Will these machines have the built-in potential to create some kind of 
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chaos? Not because the hunuaa ordered them to, but because of themselves. ThU type 
of hell it reminiscent of the movie 2001 Odyssey. We should keep this in mind ss we 
aim toward intelligent machines. I think however, that education can solve the 
problem. It is • matter of developing social responsibility through education. We have, 
in the past, been very successful in creating instruments that can be used either to help 
humanity or to destroy humanity. Take the simplest of things, as an example. Take a 
glass, a small glass, that you can use to drink water. I can break that and cut 
somebody's throat. I can use that as a weapon. Unless I control myself, and I have to 
be educated to do that, I can use even the simple things as weapons. The second 
problem is much more serious, and I do not have an answer for it. But then,... 1 do not 
know either if we are going to ever create intelligent machines. 

L. MARTIN, Telerobotics International: I would like to state an alternative view on 
anthropomorphic kinematics. The thoughts behind anthropomorphic kinematics is that 
we are trying to replace a man in an BVA atmosphere trying to do a variety of tasks. No 
doubt there are machines that can do specific tasks (such as punching buttons, plugging 
holes, etc..) in a much more simplistic manner. To be able to replace a man in that 
environment, with the design standards that are already in place for NASA to be used in 
satellites that will be put in place decades from now, we must look at machines that 
replicate the kinematic characteristics of the human (in size, lift capacity, etc.). I 
would like to make one observation in general. I think there is not a whole lot of 
hardware work being done at the present time. I would like to take you back about 
eighty year.: by analogy. I would challenge this group, that if eighty years ago you had 
the challenge that the Wright brothers had, you would be designing the automatic pilot 
before designing the airplane. I would suggest the more pragmatic approach. 

G. A. BBKBY: Thank you for your comment. That will not go unanswered here. 

G. SARIDIS: Well, I am for parallel processing, instead of serial processing. The 

question was whether we were going to do that serially or in parallel. We do both 
research and experimentation, in parallel. If we were smart enough ten years ago, when 
the Japanese announced their automation program, and we were doing basic research as 
they did, our industries would not be in the bind that they are in now. We are trying to 
catch-up and establish a more competitive role with their industry. 

R. BAJCSY: I am sorry you got this impression. I am in academia and we do very hard 
hands-on experimentation. So do most of my colleagues in robotics. While we are 
doing hands-on experimentation, we have to look ahead. We have to look for generic 
problems, for problems that can be generalized. We have to look for principles. We 
have to make contributions to the general knowledge in robotics. While we are doing 
that, we are also testing on concrete examples. We are not avoiding doing hands-on 
experiments. 

J. Y. S. LUH: That is a very good answer. I just want to add specific comments about 
the issue of kinematics. The problems of kinematics is well studied, and there are many 
results. The important topic currently is that of redundancy. The redundant kinematic 
robot has been studied by many researchers. There are to my knowledge anywhere 
between twenty to thirty papers, several of them even in journals. However, more 
experimentation is required. 

G. A. BBKBY: I can also make one observation of my own. My colleague, Professor R. 
Tomovic of Yugoslavia, and l are in he process of not only studying but building a 
five-finger anthropomorphic hand for robotic applications. With any luck at all, you 
will see it at the forthcoming IEEE Robotics Conference in Raleigh. But, at the same 
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time, I am very supportive of the efforts of my colleague Professor Lee, (who is sitting 
out here) who has designed a hand with two thumbs. Some of us may be all thumbs, but 
his is meant to be an alternative robotic hand which is not at all anthropomorphic. So, I 
think we need to keep the principles that various people have brought here all in mind. 
I think you would be surprised, back to the questioner, how much hardware work goes 
on. And, if we had the resources to do more prototyping and experimentation, we 
would do a lot more. Developing prototype hardware is very expensive, and most of us 
simply do not have the resources. But this is not from a lack of desire to do so. 

D. PIVIROTTO, JPL: I wanted to touch on the point on why there were so few papers 
on parallel processing. One of the things that I have been doing is to search for 
someone who is actually putting together flight computer systems that we might be 
able to use for space robots. There is no one, as far as I can determine. Do you have 
any knowledge about research in computer systems for robotic applications in space? 
As probably everybody knows, space computers have to be reliable, fault-tolerant, 
radiation protected, etc. You cannot just take a PC and fly it. I also notice that there 
does not seem to be a whole lot of work in computational systems for practical 
telerobotics applications in space. There was a paper by R. Travasos, at a recent 
robotics conference, who tried to address the problem of Unking up different kinds of 
computers to solve the overall telerobotics problem. I have not seen very much of that 
going on. This work is interdisciplinary. I was wondering whether the panel could shed 
any light on this. Are there research programs of that nature? 

A. SANDERSON: I cannot say that I know of a real effort at space- hardened multi or 
paraUel processing. The efforts at the research level tend to be at architectures for 
appUcations, and there are selected prototypes that have been built. At CMU, there is 
a WARP processor, a systolic array machine, which Is being used on a mobile vehicle. It 
resides in the mobile vehicle. It is being used for low-level image processing. It is 
being used as part of the autonomous system. I do not know of an example which 
includes aU of the aspects needed for space- fUght applications. 

G. A. BBKBY : It is very simple. Just get a space-hardened connection machine. 

D. PIVIROTTO: ActuaUy, the DARPA machines appear to be based on the foUowing 
approach. People have a theory of what makes a good AI processor. Somebody else 
then has a theory about how to link machines together to do processing. ActuaUy, none 
appear to meet the anticipated performance. I have heard papers on several machines. 
None of them are designed from the point of view of defining a system that we are 
trying to make work, and putting together a complement of computers to do the job. I 
think that people get seduced into thinking that, if we have the connection machine, aU 
of our problems wiU be solved. From the papers l have seen, that does not sound right. 

C. WEISBIN: What I had in mind when I brought up that question was that, if you 
attended many of these sessions, you would see a myriad of boxes: vision, planning, 
control, etc.. Many of the boxes would be running in paraUel. The implication is that 
there is research on-going in which such an implementation is being considered. 
Without beginning to consider space- hardening or anything like that, our limited 
experience with a very narrow class of machines indicates to me that there is 
significant potential out there in hardware. However, there is a major problem in 
software. If you had ten, a hundred, a thousand processors, the scheduling problem of 
how to allocate tasks to processors, and to determine how they would communicate, is a 
very legitimate research area. My comments were that I expected to see more papers 
here in this area. Regarding your question about whether anyone is to the stage of 
fielding something (and I am not even touching space now), there is one military project 
that I am aware of where that is being considered. Clearly, the conditions that you 
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would have in space would be quite different. In any event, I think it is premature. I do 
not think that we currently know how to effectively schedule and determine the 
communication modes in general for an arbitrary number of processors. 

J. C. HORVATH, JPL: I am from the Hypercube group at JPL. I would like to respond 
collectively to all of this. There has been a protect underway for quite a few years at 
J PL and Caltech to do parallel processing. We have built hardware which is meant and 
designed specifically to be space- qualifiable, eventually. Sometimes we have been 
criticized for being too simplistic Just because we are trying to stay simple, so that we 
can deal with real-world problems. We have a substantive effort ongoing in software 
and have solved quite a few real problems. I am here at this conference mostly to raise 
interest within the robotics community in using the Hypercube. So, we are ready if you 
are (Laughter). 

C. WBISBIN: I would like to ask a question of the speaker. I applaud the effort in using 
Hypercubes. It is fully consistent with what we are doing. I just want to know whether 
you disagree with the statement I made that, as far as I knew, task allocation and 
scheduling of a variety of tasks (planning, manipulation, control, sensing, etc.), with 
real-time feedback, is something that is still in the research stage and not ready to go. 
Would you disagree? 

J. HORVATH: I would hesitate in this forum to say yes or no. I know that there are 
other members of our group that want to do it, but I do not really know how far along 
they are. I would have to refer you to them. 

A. GOFORTH, NASA ARC: There is a program at Ames that is developing a 
space-home symbolic processor. You might see a paper in a year or so about its 
applicability to telerobotics. We considered it for the strawnun design of the Flight 
Telerobotic Servicer. Yes, there is work on processors targetted for space. It is rather 
premature to present papers, when the details of the hardware have not yet been 
worked out. The big problem for a space- qualified system is the power consumption, (f 
you want to use a parallel processor, the real problem is the bus or the 
interconnections. That uses a lot of power. So, a lot of power is required to support 
parallelism. I am not saying that it cannot be done, but that is the real tall pole. With 
regard to the issue of tasking software for parallel operations, it is true that no one 
knows how to .0 that. But, for pure master- slave teleoperations, that is easy. It may 
be as easy as taking what is being done with VAX machines in laboratories and moving 
it into space- qualified equipment. 

G. A. BBKBY: Let me suggest that it may be a good time to stop and perhaps continue 
on an Individual basis. Let me summarize in a few sentences what I think has happened 
here. This has been an extremely enlightening discussion, at least for me. It is 
interesting to see how some of the issues came up again and again in the conversations. 

• Many of the issues involve how we model the world. How do we extract 
information from it? Do we do it by using sensors? What kind of abstract 
models do we use for representation? Then, there is the issue of how 
knowledge is represented, transmitted and used for the design of systems. 

• There is a question of where the intelligence resides, and the relation 
between building intelligent machines and, conversely, using machines which 
in some sense imitate intelligent behavior. Those two things may or may not 
be equivalent. 
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• The question of uncertainty came up again and again - ways of modeling it, 
of detecting it, and Incorporating it in systems. 

• The question of evaluation and validation of systems, particularly 
man- machine systems, came up several times. It is clearly a very Important 
one, if we are going to be able to assess our successes or our failures. 

• The above issues of modeling and representation, intelligence, uncertainty, 
evaluation, and validation are all very closely related to those of planning and 
scheduling. 

• I appreciate the comment by Dr. Sanderson that robotics is an experimental 
science, and that there Is a great deal to learn by experimentation. 

• Finally, we should not forget that we do in fact have a responsibility to 
humanity and to ourselves. 

One of the panelists said that one of the important areas is to Improve communication 
among researchers. This conference has contributed admirably to that goal, t would 
like to thank Dr. Rodriguez for organizing it and bringing us all together. 
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Technical Sessions , Thursday Afternoon , January 22 

1:30 pm — 3:30 pm Panel Discussion — Little Theatre 

Future Research Directions 

Moderator: G. A. Bekey, University of Southern California 

The intent of this session is to provide views on the state of telerobotics research and to draw together 
a collection of suggested research opportunities to present to NASA. The panel members will give 
opening statements summarizing the results of earlier sessions and discussion periods. This will be 
followed by a general discussion with the audience. The moderator will conclude the session by 
stating a synthesized set of recommendations. 

3:30 pm — 5:30 pm Closing Reception, The Pasadena Center Lobby 
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